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ABSTRACT 


An  implicit  enumeration  algorithm  is  developed  to  determine  the  set  of 
efficient  points  in  zero-one  multiple  criteria  problems.  The  algorithm  is  special¬ 
ized  for  the  solution  of  a  particular  class  of  facility  location  problems.  The  pro¬ 
cedure  is  complemented  with  the  use  of  the  utility  function  of  the  decision 
maker  to  identify  a  subset  of  efficient  point  candidates  for  the  final  selection. 
Computational  results  are  provided  and  discussed. 


1.  INTRODUCTION 

The  zero-one  linear  multiple  criteria  problem  (MCP)  is  written  as 
(P)  Max{Cx  :  x  €  F) 

where  F  —  {x  €  R"  :  Ax  <  b,  Xj  «■  0  or  1,  j  ■»  1,  2,  . . . .  «},  Cand  A  are  respectively  px  n 
and  m  x  n  real  matrices,  b  is  an  m- vector  and  x  is  an  w- vector.  The  solution  set  to  (P)  is 
defined  as 

EF  -  {x°  €  F  such  that  x  €  F  and  Or  >  Cx°  implies  Cx  —  Cx°). 

The  elements  of  EF  are  said  to  be  the  efficient  points  in  (P).  We  also  refer  to  EF  as  the 
efficient  set. 

The  continuous  linear  multiple  objective  problem  has  the  property  that  every  efficient 
point  maximizes  a  linear  functional,  of  the  form  X  Cx  with  X  €  Apand  X  >  0,  over  the  feasible 
set.  This  property  does  not  hold  for  problem  (P)  and  makes  its  solution  considerably  more 
difficult.  As  a  consequence,  the  obtainment  of  EF  must  rely  on  algorithms  based  on  different 
principles  than  those  developed  to  solve  the  continuous  linear  multiple  criteria  problem. 

Pasternak  and  Passy  [6]  studied  problem  (P)  for  the  case  of  two  critieria.  They  presented 
an  algorithm  based  -m  parametric  approach  combined  with  an  extension  of  Balas’  filter  method. 
Pasternak  and  Passy  applied  the  algorithm  to  the  project  selection  problem  and  reported  the 
results  in  [5].  One  of  the  authors  developed  in  (1)  and  (2)  servers]  theoretical  results  and  two 
algorithms  to  solve  (P).  The  algorithm  in  [2]  explores  the'Tact  that  the  set  of  efficient  points  in 
(P)  can  be  seen  as  the  set  of  isolated  and  end  nodes  of  a  related  directed  graph.  In  the  same 
paper  the  author  studies  the  version  of  (P)  where  the  set  Fis  given  explicitly  as  a  finite  discrete 
set.  This  last  problem  was  also  analyzed  by  Zionts  and  Wallenius  [14]  and  Zionts  (13).  Shapiro 
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(8]  mentions  several  applications  for  problem  (P)  and  relates  it  to  integer  programming  duality 
theory.  Villarreal  and  Karwan  [111,  [10]  presented  a  dynamic  programming  approach  to  solve 
problem  (P)  for  the  case  of  integer  variables  not  necessarily  restricted  to  be  zero-one.  More 
recently  Bitran  and  Lawrence  [3]  applied  the  algorithm  developed  in  [2]  to  solve  a  facility  loca¬ 
tion  problem  formulated  as  a  zero-one  multiple  criteria  problem. 

The  determination  of  the  set  EF  can  be  seen  as  a  first  step  in  solving  the  decision  problem 
implied  by  (P).  Typically,  the  decision  maker  wants  to  select  just  one  point  and  therefore  a 
second  step  must  be  undertaken  to  select  such  point.  An  alternative  approach  to  the  one  taken 
in  the  references  cited  above  is  the  utility  theory  methodology  [4],  These  two  classes  of 
methods  have  been  developed  almost  separately  although  they  are  both  interested  in  the  same 
types  of  problems.  We  believe  that  both  approaches  have  their  weaknesses.  The  efficient  point 
methods  fall  short  of  solving  the  decision  problem,  while  the  utility  theory  procedures  rely  on 
the  assumption  that  the  decision  maker’s  utility  function  can  be  obtained  with  satisfactory  accu¬ 
racy.  Whenever  this  assumption  does  not  hold,  which  seems  often  to  be  the  case  in  practical 
applications,  the  analyst  should  perform  an  extensive  sensitivity  analysis  on  the  coefficients  of 
the  utility  function.  However,  sensitivity  analysis  for  integer  programs  can  be  quite  complex 
and  may  require  that  the  problem  be  resolved  for  each  possible  value  of  the  parameters. 

In  this  paper  we  first  propose  a  new  algorithm  to  determine  the  set  of  efficient  points  in 
binary  multiple  criteria  problems  and  next  we  indicate  how  utility  theory  can  be  used  effectively 
to  select  a  solution  point.  It  is  well  known  that  if  the  decision  maker’s  utility  function  is  non¬ 
decreasing  in  each  of  the  attributes,  i.e.,  criterion,  there  is  at  least  one  element  of  EF  that  max¬ 
imizes  the  utility  function  over  F.  Based  on  this  result,  we  propose  that  first  EFbe  determined 
and  then  the  utility  function  be  assessed  and  evaluated  at  each  efficient  point.  The  advantage  of 
this  approach  is  that  since  the  set  EF  is  finite  we  can  perform  an  extensive  sensitivity  analysis 
on  the  uncertain  coefficients  of  the  utility  function  in  a  very  simple  way.  Also,  the  range  of 
variation  of  each  attribute  is  precisely  known  prior  to  the  determination  of  the  utility  function. 

The  plan  of  this  paper  is  as  follows.  In  section  2  we  develop  an  implicit  enumeration 
scheme  to  determine  the  set  EF.  In  section  3  we  report  the  results  of  computational  experi¬ 
ments  on  the  performance  of  the  algorithm  and  illustrate  how  the  implicit  enumeration  scheme 
can  be  complemented  with  the  use  of  the  utility  theory  of  the  decision  maker.  In  the  last  sec¬ 
tion  we  provide  a  comparison  of  the  algorithm  with  those  of  Villarreal  and  Karwan  [11]  and 
Bitran  [2],  and  discuss  conclusions. 

2.  AN  IMPLICIT  ENUMERATION  ALGORITHM  TO  OBTAIN  EF 

The  algorithm  is  composed  of  the  following  main  subroutines:  Finding  Bounds  for  the 
Criteria,  Fathoming  by  Bounds,  Testing  Feasibility  and  Determining  Bounds  for  the  Variables, 
Backtracking,  Branching,  and  Dominance.  These  routines  are  described  in  this  section.  After 
having  set  k  variables  at  the  value  zero  or  one,  we  are  left  with  problem  (Pk)  that  we  write  as 


(Pk)  MaxJ  T 

CDXJ  L  CUXJ  f  “  1>  2,  ,  J 

j  Jffc 

(2.1) 

Subject  to  T 

I  ^uxJ  ^  4  /■  1,  2,  ... ,  * 

jtfc 

** 

(2.2) 

Xj  «  0,  1  j€J 
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The  set  Rk  is  the  set  of  indices  of  the  variables  previously  set  at  values  one  or  zero,  J  is  the  set 
(1,2,...,  «},  J  -  Rk  is  the  set  of  indices  of  the  free  variables,  and  Silc  -  f>,  -  T  auXj  i  — 


The  block  diagram  representing  the  algorithm  is  given  in  Figure  1.  The  initial  lower 
bounds  (discussed  later  in  this  section)  and  some  of  the  feasible  points  determined  by  the  algo¬ 
rithm  are  stored  in  a  matrix  denoted  by  LB.  Each  row  of  LB  contains  the  ^vector  x1  and  the 
corresponding  p-vector  Cx'.  After  passing  by  the  Dominance  routine,  the  elements  in  LB  are 
the  efficient  points  in  (P).  The  indicator  LEV  (level  in  the  tr  e)  specifies  the  number  of  vari¬ 
ables  already  fixed  at  value  zero  or  one.  It  is  convenient  to  introduce  the  following  notation. 
For  x',  x2  €  R",  Cx2  >  Cx1  mean  Cx2  >  Cx1  with  at  least  one  strict  inequality,  x1  €  R"  is 
said  to  be  dominated  by  x2  €  R”  if  Cx2  >  Cx1.  In  this  case  we  also  say  that  x2  dominates  x1. 

2.1  Finding  Bounds  for  the  Criteria,  Fathoming  By  Bounds,  and  Inverse  Dominance 

(These  three  routines  are  intimately  related  and  are  described  jointly.) 

Among  the  first  authors  to  use  bounds  for  the  criteria  in  multiple  objective  problems  are 
Villarreal  and  Karwan  [111.  In  their  combined  dynamic  and  branch  and  bound  approach,  they 
define  a  lower  bound  for  the  criteria  as  a  p-vector  z°  —  Cx°,  where  x°  is  either  efficient  or  it  is 
dominated  by  at  least  one  efficient  solution  of  the  problem.  A  lower  bound  z°  is  used  for 
fathoming  purposes  as  follows:  If  for  some  k,  the  p-vector  z*,x,  composed  of  the  maximum  of 
each  criterion  in  (Pk),  is  dominated  by  z°,  i.e.,  z°  ^  z^x,  (Pk)  can  be  fathomed.  The  reason  is 
that  for  any  feasible  point  x  in  (Pk),  the  relation  z°  >  Cx  will  hold  and  consequently  x  9  EF. 
zhnX  is  found  by  adding  to  the  p- vector  of  criteria  computed  with  the  fixed  variables  at  stage  k, 
z*,  an  upper  bound  on  the  criteria,  UB(n  -  k),  computed  with  the  free  variables.  UB(n  —  k) 
is  obtained  by  setting  for  every  objective  /  each  free  variable  j  at  value  one  or  zero,  depending 
upon  the  sign  of  its  coefficient  ctj.  xtJ  is  set  equal  to  one  if  cv  >  0  and  to  zero  otherwise.  The 
components  of  z^,x  are  upper  bounds  for  the  corresponding  criterion  in  (P). 

Lower  bounds  can  be  generated  by  several  methods.  One  consists  in  solving  a  relaxed 
version  of  ( P )  obtained  by  ignoring  the  constraints  Ax  <  b  and  next  selecting  those  efficient 
points  that  satisfy  the  constraints  initially  deleted.  Note  that  this  set  is  a  subset  of  EF.  Another 
method,  used  by  Villarreal  and  Karwan  [11],  maximizes  a  positive  combination  of  the  criteria, 
k  Cx  with  A  >  0,  over  the  feasible  set  F.  The  points  obtained  are  efficient  in  ( P ).  In  fact,  any 
feasible  point  x  in  F  generates  a  p-vector  Cx  that  can  be  used  as  a  lower  bound.  However, 
those  determined  by  points  x  €  £Fare  the  most  effective.  In  out  algorithm  we  initially  generate 
lower  bounds  by  a  variant  of  the  last  method.  A  vector  A  €  Rf,  A  >  0  is  chosen  (see  Section 
3)  and  the  vector  AC  is  computed.  A  variable  with  the  highest  positive  coefficient  in  AC  is  set 
equal  to  one.  Next,  a  variable  with  the  remaining  highest  positive  coefficient  in  AC  is  set  equal 
to  one  and  we  continue  in  this  fashion  until  either  all  variables  with  positive  coefficients  are  set 
to  the  value  one  or  we  are  prevented  from  continuing  bemuse  we  have  reached  an  infeasible 
solution.  We  illustrate  this  procedure  with  an  example. 

EXAMPLE  1:  Assume  that  AC  «»  (3,  2,  1,  -2,  -1)  and 

F  »  [x  €  Rs  :  xj  +  x2  +  5xj  +  x4  +  3xs  <6,  Xj  -  0,  1  j  -  1,  2,  3,  4,  5). 
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Thus,  and  x2  are  set  equal  to  one  and  all  other  variables  are  set  equal  to  zero.  If  we  denote 
the  feasible  solution  obtained  by  x°,  Cx°  is  used  as  a  lower  bound.  Early  experiments  per¬ 
formed  to  test  the  effectiveness  of  some  methods  to  generate  lower  bounds  suggested  that  the 
variant  used  presents  a  satisfactory  tradeoff  between  the  quality  of  the  bound  and  the  time 
spent  to  compute  it. 

For  small  values  of  k  the  values  of  the  upper  bounds  2^  are  fairly  "large”  and  the  bound¬ 
ing  of  the  criteria  is  not  effective.  For  this  reason  we  have  explored  the  alternative  of  using  the 
subroutine  Fathoming  by  Bounds  for  k  fin)  where  fin)  is  a  pre-specified  number.  The 
computational  results  obtained  varying  fin)  tat  presented  in  the  next  section. 


Whenever  we  reach  the  end  of  a  branch  in  the  tree  enumeration  method  {LEV  —  n)  and 
find  a  point  x1  (note  that  x1  is  such  that  Ax1  £  b  and  C r1  >  r°).  We  compare  it  with  the  ele¬ 
ments  in  matrix  LB.  If  some  element  x'  of  LB  dominates  x*(  Cx‘  >  Cx1),  we  discard  x1.  Oth¬ 
erwise,  it  is  simply  added  to  the  matrix.  We  call  this  test  Inverse  Dominance.  The  experi¬ 
ments  performed  indicated  that  it  is  not  worthwhile  to  also  test  if  the  feasible  point  dominates 
elements  in  matrix  LB.  Rather,  it  is  more  effective  to  execute  the  subroutine  Dominance  at 
the  end  of  the  algorithm. 

2.2  Feasibility  and  Bounding  for  Variables 

The  extended  Geometric  Definition  Method  of  Zionts  [12]  is  applied  in  this  paper  for  the 
multicriteria  case.  The  objective  is  to  find  upper  and  lower  bounds  on  some  variables,  those 
most  likely  to  be  bounded.  If  for  some  variable  j,  the  upper  bound  uy  is  less  than  one,  the 
variable  is  set  at  value  zero  and  if  a  lower  bound  hj  greater  than  zero  is  found,  the  variable  is 
set  at  value  one.  The  bounds  are  found  by  setting  the  remaining  variables  at  zero  or  one 
according  to  the  sign  of  their  coefficients  in  the  constraint  matrix  A. 

The  Mock  diagram  of  Figure  2  illustrates  the  subroutine  Testing  Feasibility  and  Determin¬ 
ing  Bounds  for  the  Variables.  Each  constraint  t  is  considered  differently,  depending  upon  the 
sign  of  its  right-hand-side  (RHS)  Sf  at  stage  k. 

If  the  RHS  is  negative,  we  test  to  see  if  Sf  <  SANf  iSANf  is  the  sum  of  the  negative 
coefficients  of  the  free  variables  of  constraint  /  at  stage  k).  In  this  case  iPk)  is  infeasible  and 
the  algorithm  backtracks.  If  5*  “  SANf,  all  negative  variables  are  set  at  one  and  all  positive 
variables  at  zero.  Finally,  if  Sf  >  SANf  the  lower  bound  hj  is  computed  for  a  variable  with  the 
most  negative  coefficient  and  the  upper  bound  Uj  is  computed  for  a  variable  with  the  most  posi¬ 
tive  coefficient. 

If  the  RHS  is  positive,  we  test  to  3ee  if  constraint  /  satisfies  the  condition  Sf  £  SANf  + 
SAPf  where  SAPf  is  the  sum  of  all  positive  coefficients  of  the  free  variables  of  row  /at  stage  k. 
In  that  case  we  continue  to  the  next  constraint.  Otherwise,  we  find  the  upper  bound  for  a  most 
positive  variable. 

This  simplified  method  of  finding  bounds  for  only  the  most  positive  and  most  negative 
variables  of  every  row  is  analyzed  in  Rivera  [7],  A  procedure  for  finding  all  the  variables  that 
need  to  be  tested  for  bounding  is  also  discussed. 
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Figure  2.  Subroutine  testing  feasibility  and  determining  bounds  for  the  variables 
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2.3  Backtracking 

The  backtracking  subroutine  is  very  simple.  The  algorithm  moves  backward  in  the  tree 
until  a  variable  which  has  taken  only  one  of  the  two  possible  values  is  found.  The  variable  is 
set  at  the  other  value  and  the  algorithm  proceeds  as  in  Figure  1.  Variables  whose  values  are 
directly  set  in  the  Feasibility  and  Bounding  for  Variables  subroutine  are  skipped  over  in  the 
backtracking  procedure  since  their  complementary  values  are  not  worth  investigating. 

2.4  Branching 

The  branching  rules  are  designed  to  drive  the  problem  towards  infeasible  parts  of  the  tree 
so  that  the  subroutine  Feasibility  and  Bounding  for  Variables  just  described  is  more  effective. 

The  block  diagram  of  Figure  3  shows  the  branching  rule  strategy.  Only  one  constraint  is 
analyzed  each  time.  If  there  are  some  constraints  with  negative  RHS's,  the  algorithm  chooses 
one  with  Min  (S?  -  SAN?)  and  branches  on  a  variable  x.  with  most  negative  coefficient,  setting 

i 

xq  at  value  zero. 


In  the  absence  of  constraints  with  negative  RHS,  the  algorithm  attempts  to  create  one.  In 
this  case  a  constraint  with  the  least  slack,  Min  (S?  -  SAN?  —  SAP ?),  is  chosen  and  a  variable 

with  the  largest  positive  coefficient  is  set  equal  to  one. 

2.5  Dominance 

Before  entering  the  Dominance  subroutine,  matrix  LB  contains  the  lower  bound  used. 
These  include  the  initial  lower  bound  determined  and  all  feasible  points  not  eliminated  by  the 
Inverse  Dominance  subroutine.  The  set  of  efficient  points  in  (/*)  is  contained  in  the  set  of  ele¬ 
ments  of  LB,  Therefore,  LB  may  contain  dominated  points.  In  order  to  identify  these  point, 
pair-wise  comparisons  between  all  elements  in  LB  were  performed. 

Another  procedure  tested  was  as  follows.  The  elements  in  LB  were  kept  ordered  by 
decreasing  value  of  the  sum  of  their  components.  Each  feasible  point  x1  determined  by  the 
algorithm  was  tested  to  see  if  it  dominated  or  was  dominated  by  elements  in  LB.  We  have  used 
the  fact  that  a  necessary  condition  for  x  to  dominate  y  is 

Although  with  this  procedure,  at  the  end  of  the  algorithm,  matrix  LB  contained  the  efficient 
points  of  the  problem  (without  the  need  for  the  Dominance  subroutine),  it  did  not  improve  the 
computation  time. 

2.6  A  Class  of  Facility  Location  Problems 

We  conclude  this  section  specializing  the  implicit  enumeration  algorithm  for  a  class  of 
facility  location  problems  of  the  form 
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Select  a  Pow  i 
with 

Min(Sk-SANk) 

1  k 
All  i:  S*T  <  0 


Select  a  Row  i 
with 

Min(Sk-SANk-SAPk) 

1  k 

All  i:  S*  >  0 


Select  a 
Variable  x 

q 

such  that 

1*1,1  -  Max  l*ij 

a«  ' 0 


Set  x  +0 

q 


Return 


Select  a 
Variable  x 
such  thatP 

a  »  Max  a . , 
iP  i; 

aij >0 


_ i 

/ 

Set 

x  -*-1 

P 

\ 

( 

Return 

Figure  3.  Subroutine:  brinching 
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(FLP)  Max  Or 
subject  to  Ax  ^  b 

(2.3)  £  <  1  r  »  1 ,  2,  . . .  ,  q 

ju, 

Xj  -  0  or  1  j  «■  1,  2,  . . .  ,  n 

where  Ju  J2,  ....  Jq  form  a  partition  of  J  —  {1,  2,  . . .  ,  n)  and  C,  A,  b,  and  x  are  as  in  ( P ). 

The  constraints  (2.3)  state  that  at  most  one  element  of  each  set  Jr,  r  ■*  1,  2 . q  can  be 

part  of  an  efficient  point  of  (FLP).  Such  conditions  are  common  in  practical  settings,  mainly 
when  the  sets  J,  represent  heterogeneous  geographical  regions  with  laws  and  regulations  that 
differ  significantly.  Bitran  and  Lawrence  [31  report  an  application  of  (FLP).  The  special  struc¬ 
ture  of  constraints  (2.3)  permits  a  significant  simplication  in  the  branch  and  bound  algorithm 
described. 

In  (FLP)  a  maximum  of  q  variables  can  be  set  to  one.  This  fact  is  exploited  in  the 
Fathoming  by  Bounds  subroutine.  When  UB(J  -  k)  is  determined,  it  is  not  necessary  to  set 
all  the  positive  free  variables  to  one,  but  only  the  q  -  w  with  the  largest  positive  coefficients, 
where  w  is  the  number  of  variables  fixed  at  value  one.  We  also  modified  subroutines  Feasibil¬ 
ity  and  Bounding  for  Variables  and  Fathoming  by  Bounds  to  exploit  the  fact  that  at  most  one 
variable  among  those  in  which  indices  are  elements  of  each  Jr  may  be  one. 

The  obtainment  of  lower  bounds  for  the  criteria  is  also  slightly  changed  for  this  special 
structure.  After  computing  the  vector  kC,  k  >  0,  k  6  Rp,  the  algorithm  chooses  a  variable  in 
J i,  the  one  with  the  largest  positive  coefficient  in  kC  and  sets  it  equal  to  one.  Next,  the 
variable  in  J2  with  the  largest  positive  coefficient  in  k  C  is  set  equal  to  one.  The  procedure  con¬ 
tinues  in  this  fashion  until  either  one  variable  for  every  Jr,  r  —  1,  2 . q,  is  set  to  the  value 

one  or  an  infeasibility  is  reached. 

3.  COMPUTATIONAL  RESULTS 

In  this  section  we  present  computational  results  on  the  performance  of  the  implicit 
enumeration  algorithm  of  Section  2.  We  have  attempted  to  analyze  the  impact  of  the  following 
parameters,  the  number  of  lower  bounds,  number  of  objective  functions,  magnitude  of  the 
RHS,  number  of  variables,  and  the  stage  in  which  to  start  the  bounding  of  the  objectives.  Both 
problems  (P)  and  (FLP)  are  studied  in  these  experiments.  We  conclude  this  section  by  illus¬ 
trating  the  combination  of  the  efficient  points  and  utility  theory  approaches. 

All  problems  tested  were  randomly  generated  with  the  coefficients  in  matrices  A  and  C  in 
the  interval  (0,100],  The  RHS  of  each  constraint  was  set  at  a  fixed  proportion  K  (K  *»  .25,  .50, 
.75)  of  the  sum  of  the  corresponding  coefficients  in  matrix  A.  The  reason  for  this  selection  of 
data  is  to  permit  comparison  with  other  works  in  the  liturature  [1],  [2],  and  [11].  The  results 
obtained  with  the  special  structure  of  the  particular  facility  location  problem  (FLP)  are 
significantly  superior  to  the  corresponding  results  for  problem  (P).  All  computations  were 
made  with  the  computer  PRIME  400.  The  computer  programs  were  written  in  FORTRAN. 
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The  following  additional  notations  will  be  used: 

NTB  —  number  of  problems  ( Pk )  fathomed  for  k  *  1,  2,  ....  n. 

NPA  —  number  of  feasible  points  after  bounding  (i.e.,  number  of  end  branch  points). 

NEP  —  number  of  efficient  points  in  ( P ). 

Q  —  number  of  feasible  points  after  inverse  dominance. 

T  —  Total  central  processing  unit  (c.p.u.)  time  in  seconds. 

BT  -  c.p.u.  seconds  spent  in  bounding  the  objectives. 

3.1  Problem  iP) 

The  impact  of  the  number  of  lower  bounds  and  the  stage  at  which  to  start  the  bounding 
procedure  were  initially  explored. 

As  did  Villarreal  and  Karwan  [111  for  their  dynamic  programming  algorithm,  we  tested 
several  problems  with  one,  two,  and  three  lower  bounds.  The  values  of  A  €  R3  chosen  were 
(1,1,1),  (3,1,1)  and  (1,1,3).  The  results  were  highly  dependent  upon  the  size  of  the  problem. 
For  problems  with  up  to  fourteen  variables,  one  lower  bound  performed  better;  for  fifteen  to 
twenty  variables,  two  lower  bounds  gave  the  best  results;  and  for  more  than  twenty  variables, 
three  lower  bounds  showed  superior  performance.  In  Table  1  we  present  a  sample  of  the 
experiments  performed  for  problems  of  type  (/*).  The  c.p.u.  times  correspond  to  the  mean  and 
standard  deviation  of  a  sample  of  five  problems. 


TABLE  1  —  Problem  (P),  c.p.u.  Times, 
p  —  3,  n  —  14,  and  m  —  4 


One  Lower 
Bound 

Two  Lower 
Bounds 

m 

Median 

33.12 

41.04 

Range 

2S.20-66.60 

28.44-67.68 

n 

Median 

35.64 

56.88 

Range 

18.72-136.44 

42.48-163.44 

EB 

Median 

15.12 

19.08 

Range 

11.88-31.68 

14.76-34.92 

In  Figures  4a  and  4b  we  present  a  sample  of  typical  results  obtained  when  the  bounding 
procedure  starts  at  different  levels,  i.e.,  after  having  fixed  a  certain  number  of  variables.  The 
computational  experiments  indicated  that  the  bounding  procedure  was  not  effective  when  intro¬ 
duced  at  levels  smaller  than  n  -  3,  i.e.,  fin)  <  n  -  3.  Figures  4a  and  4b  illustrate  the  case 
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where  the  bounding  procedure  was  started  at  each  of  the  levels  indicated.  The  decrease  in 
computation  time  between  level  1  and  level  9  represent  the  savings  for  not  using  the  bounding 
procedure.  For  f(n)  >  n  -  3,  NTB  and  T  start  to  increase  because  the  bounding  routine 
fathomed  a  large  number  of  problems  (/*).  The  points  in  both  figures  for  LEVEL  •  12 
correspond  to  not  using  the  Fathoming  by  Bounds  subroutine. 

Tables  2  and  3  show  the  results  obtained  with  problems  of  type  ( P )  with  varying  numbers 
of  objectives,  variables  and  factor  K.  For  each  set  of  parameters,  five  problems  were  solved. 
The  computational  results  suggest  that  increasing  the  number  of  objectives  increases  the  com¬ 
putation  time.  Essentially,  the  reason  is  that  the  number  of  efficient  points  also  increases. 
Problems  with  K  —  .25  tend  to  be  easier  to  solve  because  the  number  of  feasible  points  is 
lower  than  for  K  —  .50  and  K  —  .75.  Problems  with  K  -  .25  are  also  favored  by  the  design  of 
the  algorithm  that  branches  toward  infeasibility.  By  branching  in  the  opposite  way  we  can 
obtain,  for  problems  with  K  —  .75,  essentially  the  same  results  observed  with  K  —  .25.  The 
same  effect  occurs  with  other  algorithms  in  the  litureature  [2],  Therefore  the  hardest  problems 
to  solve  in  terms  of  c.p.u.  time  are  the  problems  with  K  ”  .50.  An  intuitive  justification  for 
these  observations  is  the  fact  that  for  K  —  .25  [ft  —  .75]  most  efficient  points  tend  to  be  close 
to  (0,  0,  ...  ,  0)  €  /?"[(!,  ....  ,1)  €  /?"].  To  visualize  the  impact  of  the  number  of  variables 
on  the  computation  time,  we  have  plotted  in  Figure  5  the  mean  c.p.u.  time  for  the  problems 
with  p  —  3,  m  —  8,  K  -  .25  of  Table  3.  As  expected,  the  computation  time  increases 
exponentially. 

3.2  Facility  Location  Problems 

In  Table  4  we  present  computational  results  corresponding  to  problems  of  type  (FLP).  A 
sample  of  five  problems  was  solved  for  each  set  of  parameters.  From  the  nine  constraints,  five 
are  of  the  special  structure  £  <  1 .  The  comments  made  for  Problem  (P)  with  different  values 

xj 

of  K  also  apply  for  the  results  in  Table  4.  However,  the  times  to  solve  the  facility  location 
problems  are  considerably  smaller  than  those  needed  for  Problems  ( P ).  Of  course  the  reason  is 
that  (FLP)  has  a  special  structure  that  permits  a  specialization  of  the  algorithm.  In  Figure  6  the 
computation  times  obtained  for  ( P )  and  (FLP)  are  compared.  The  solid  line  corresponds  to 
the  curve  of  Figure  5,  while  the  dashed  line  represents  the  problem  of  Table  4  with  p  ■*  3, 
m  -  9,  and  K  -  .25. 


TABLE  2  -  Problem  (P) 


7Xsec) 

NEP 

/>“ 5,  8,  K—.25,  n~  10 

Median 

0.31 

4 

Range 

0.24-0.42 

2-9 

P" 5,  m~8,  K“,25,  n*“  14 

Median 

1.59 

17 

Range 

1.14-1.73 

10-24 

p~5,  8,  K*,25,  n«"18 

Median 

12.97 

70 

Range 

10.15-13.57 

37-91 
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TABLE  3  —  Problem 


7Tsec) 


Median 

2.02 

10 

Range 

1.35-2.23 

4-25 

p— 3,  m— 8,  K— .50,  «— 10 

Median 

1.06 

10 

Range 

0.93-1.50 

7-14 

p—  3,  m—8,  AT— .25,  /?— 10 

Median 

0.22 

5 

Range 

0.17-0.25 

1-6 

p— 3,  m—8,  AT—  .75,  n— 14 

Median 

15.63 

20 

Range 

13.80-20.08 

11-55 

p— 3,  m—8,  Af— .50,  n— 14 

Median 

14.70 

22 

Range 

8.94-17.34 

15-62 

p— 3,  m—8,  AT—.25,  14 

Median 

1.08 

5 

Range 

0.73-1.25 

3-11 

p— 3,  m—8,  AT™. 75,  n— 18 

Median 

223 

36 

Range 

194-355 

26-62 

p— 3,  m—8,  A". 50,  w— 18 

Median 

195 

36 

Range 

138-216 

22-54 

p— 3,  m—8,  /C*™ .25,  n— 18 

Median 

8.00 

20 

Range 

6.39-8.20 

13-41 

p— 3,  m—8,  A—. 25,  n— 20 

Median 

16.76 

14 

Range 

14.80-21.30 

12-36 

Median 

77.30 

24 

Range 

66.70-104.40 

16-60 

p— 3,  m—8,  AC— .25,  n— 25 

Median 

229 

50 

Range 

170-290 

28-76 
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p— 3,  m— 9,  Km 


p— 3,  m— 9,  AT" 
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TABLE  4  -  Problem  (FLP) 


— 

7Tsec) 

NEP 

-.75,  n-10 

Median 

1.91 

14 

Range 

1.28-2.11 

8-22 

*.50,  n— 10 

Median 

1.83 

8 

Range 

0.98-2.07 

1-12 

•.25,  n-10 

Median 

0.58 

5 

Range 

0.55-0.71 

3-8 

■.75,  n-15 

Median 

5.61 

28 

Range 

3.08-7.05 

8-46 

-.50,  n-15 

Median 

7.17 

28 

Range 

3.33-9.39 

8-47 

■.25,  n-15 

Median 

3.78 

7 

Range 

3.45-3.86 

4-11 

■.75,  »— 20 

Median 

14.93 

42 

Range 

7.51-30.16 

13-51 

-.50,  n— 20 

Median 

13.60 

42 

Range 

7.35-32.35 

13-44 

■.25,  n— 20 

Median 

13.18 

23 

Range 

10.51-19.22 

10-48 

■.25,  n— 25 

Median 

37.83 

58 

Range 

23.95-66.75 

20-80 

■.25,  n— 30 

Median 

111.80 

74 

Range 

65.64-118.81 

43-86 

■.25,  n— 35 

Median 

165.96 

67 

Range 

88.22-174.47 

50-110 
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3.3  Combining  the  Efficient  Point  and  Utility  Theory  Approaches 

The  following  utility  function  for  each  criterion  /  was  considered  for  the  purpose  of  illus¬ 
tration  of  the  procedure. 

m,(x)  -»  a((c,  -  e~b‘x). 

The  utility  function  of  the  decision  maker  was  assumed  to  be  additive,  i.e., 

3 

u  -  k\U\  +  k2u2  +  kiUi  with  2^ 

The  parameters  were  chosen  as  follows.  k\,  k2,  and  bj  were  generated  randomly,  while  a,  and 
c,  were  determined  such  that  the  maximum  and  minimum  values  of  u((-)  be,  respectively,  one 
and  zero.  In  a  real  life  application  the  parameters  bh  klf  and  k2  must  be  assessed  using,  for 
example,  the  procedures  in  [4],  Other  forms  of  utility  function  would  not  alter  the  methodol¬ 
ogy  described  here  as  long  as  they  are  monotonic  nondecreasing  in  each  criterion. 

As  mentioned  earlier,  the  method  we  propose  consists  in  first  determining  the  set  of 
efficient  points  and  next  performing  an  extensive  sensitivity  analysis,  of  the  assessed  parame¬ 
ters,  of  the  utility  function  of  the  decision  maker  over  the  set  EF.  We  recall  that  under  the 
mild  condition  that  the  utility  function  be  monotonic  nondecreasing  in  each  objective,  it  will 
attain  its  maximum  over  the  feasible  set  at  least  one  efficient  point. 

In  our  example,  the  sensitivity  analysis  was  performed  by  varying  the  selected  coefficients 
in  a  ±10%  interval  around  their  original  values.  The  results  are  reported  in  a  matrix  form 
when  only  two  parameters  are  varied.  Table  5  illustrates  such  a  situation.  The  parameters  kt 
and  k2  were  varied  incrementally  by  .01  kt  over  the  intervals  [.90  Art,  1.10  kx]  and  [.90  k2, 
1.10  k2]  and  the  resulting  utility  function  value  was  computed  for  all  points  in  EF.  The  code 
corresponding  to  the  point  that  maximizes  the  utility  of  the  decision  maker  is  given  in  the 
matrix.  We  have  indicated  in  the  table  the  regions  where  a  given  efficient  point  remains 
optimal. 

When  more  than  two  parameters  are  varied  at  the  same  time,  the  matrix  representation  is 
not  applicable.  In  this  case,  the  efficient  points  that  maximize  the  utility  function  for  some 
combination  of  the  parameters  are  indicated  together  with  the  percentage  of  cases  tested  where 
they  were  the  preferred  choice  of  the  decision  maker.  In  Table  6  we  have  given  an  example  of 
this  procedure.  Five  parameters  were  varied,  bt,  b2 ,  b2,  k\,  and  k2.  Random  numbers  in  the 
intervals  [.9  bh  1.10  b,  1  and  {.9  kJt  1.10  kjJ,  i  —  1,2,3  and  j  —  1,2  were  generated  and  the 
resulting  utility  function  was  computed  for  all  elements  in  EF.  The  percent  of  cases  in  which 
each  efficient  point  is  optimal  is  indicated  in  the  table.  In  this  case,  for  example,  efficient  point 
number  9  had  the  highest  value  of  the  utility  function  is  sixty  nine  percent  of  the  cases. 

Both  problems  in  Tables  S  and  6  are  of  the  facility  location  type.  Tables  S  and  6  suggest 
that  the  number  of  alternatives  to  be  presented  to  the  decision  maker  when  the  two  approaches 
are  combined  is  likely  to  be  manageable.  In  this  way,  the  analyst  does  not  have  the  unpleasant 
task  of  presenting  the  entire  set  of  efficient  points  to  the  decision  maker  and  at  the  same  time 
he  can  draw  on  the  strengths  of  utility  theory. 
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TABLE  5:  Utility  Function  Sensitivity  Analysis  Varying  k\  and 

p  n  a  K 

3  25  6  0.25 


UTILITY  FUNCTION  COEFFICIENTS 


kl  k2 

0.60  0.19 


UTILITY  FUNCTION  SENSITIVITY  ANALYSIS 


0.9k,  k  1.1k 

1  1  1 


42 

42 

42 

42 

42 

42 

42 

42 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

42 

42 

42 

42 

42 

42 

42 

42 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

42 

42 

42 

42 

42 

42 

42 

Tf? 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

42 

42 

42 

42 

42 

421 

"46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

|48 

42 

42 

42 

42 

42 

,  46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

|4B 

42 

42 

42 

42 

42 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46  48 

48 

42 

42 

42 

42 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

48 

48 

48 

42 

42 

42 

|  46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

1 48 

48 

48 

42 

42! 

46" 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

48 

48 

48 

48 

42 

[46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

J?6 

,48 

48 

48 

48 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

■48 

48 

48 

48 

48 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

1 48 

48 

48 

48 

48 

48 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

j  48* 

48 

48 

48 

48 

48 

48 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

Us 

48 

48 

48 

48 

48 

48 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

i4T 

48 

48 

48 

48 

48 

48 

48 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

1 48 

48 

48 

48 

48 

48 

48 

48 

48 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46 

nra" 

48 

48 

48 

48 

48 

48 

48 

48 

48 

46 

46 

46 

46 

46 

46 

46 

46 

46 

46  | 

'4fT 

48 

48 

48 

48 

48 

48 

48 

48 

48 

48 

46 

46 

46 

46 

46 

46 

46 

46 

46 

"4F 
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43 
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TABLE  6  —  Output  of  the  Algorithm 

P  n  m  K 

3  35  6  0,2 5 

Q  NF'A  NTE< 

377  824  2867 


EFFICIENT  POINTS 


1 

430.51 

307.94 

334.08 

2 

354.14 

426.28 

387.93 

3 

425.10 

416.42 

274.17 

4 

426.28 

387.93 

392.48 

S 

437.69 

328.42 

373.54 

6 

270.06 

320.49 

428.53 

7 

253.67 

271.97 

437.69 

8 

429.71 

329.32 

406.60 

9 

434.27 

376.52 

402.53 

10 

307.12 

434.27 

290.54 

11 

431.09 

395.85 

382.81 

12 

427.67 

351.24 

406.72 

13 

271.97 

437.69 

328.42 

14 

428.16 

203.54 

413.49 

15 

353.62 

431.09 

395.85 

16 

391.50 

419.10 

321.10 

17 

413.22 

417.93 

413.22 

18 

355.64 

425.10 

416.42 

19 

417.93 

413.22 

427.67 

20 

384.50 

307.12 

434.27 

-b  x 

ui  “  al(cl  ‘  •  >  1  *  1.2.3 


UTILITY  FUNCTIONS 
u 


klux  +  k2u2  +  k3u3 


3 

£ 

1-1 


ki  ' 


COEFFICIENTS  OF  UTILITY  FUNCTION 

a,  c 


1 

5.903 

bl 

-0.002 


“2 

3.6.'' 


3 

7.306 


-0.003  -0.001 


1 

0 . 660 
kl 

0.600 


c2 

0.517 

k2 

0.192 


0.696 


COEFFICIENTS  bl  .  b2  . 


+  AND 


>.  *1  OHO  k2  ARE  RANDOMLY  SELECTED  PETWEEN 
10X  OF  THE  VALUES  APOVE.  FUR  PURPOSE  OF  SENSITIVITY. 


IITH  ITY  FUNCTION  SENSITIVITY  rtflr.l  TOTS: 

PERCENT  OF  CASES  EACH  IITICILNl  TUJNI  IS  OP  I  INAL. 


ErriCIKN I  N-S 
fklQulncy  I 


1  3  4  5  6  7 

0  0-10  0  0  o 


0  V  ID  n  C  13  14  15  16  17  18  19  20 
0  69  0  1/  0  0  0  0  0  0  0  10  0 
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4.  COMPARISONS  WITH  EXISTING  ALGORITHMS,  AND  CONCLUSIONS 

A  direct  comparison  of  the  c.p.u.  times  obtained  with  other  results  available  in  the  litera¬ 
ture  is  not  straightforward,  mainly  because  the  corresponding  experiments  were  undertaken  on 
different  computers.  However,  we  report  for  the  first  time  results  on  the  number  of  efficient 
points  and  c.p.u.  times  for  problems  with  up  to  25  variables  in  the  general  structure  problem 
(P)  and  up  to  35  variables  in  the  special  structure  problem  (FLP)  with  8  constraints. 

Table  7  contrasts  our  results  with  those  of  Villarreal  and  Karwan  [11]  and  Bitran  [2]. 


TABLE  7  -  Comparison  of  Solution  Times  (c.p.u.  Seconds)  for  Problems 
(P)  of  Size  m  ■  4,  n  ■  10,  and  p  —  3 


ID 

Implicit 

Enumeration* 

Villarreal  and  Karwan  [11]** 

Bitran  [2]*** 

0.75 

problem  1  1.39 

problem  2  0.78 

5.55 

0.63 

B 

problem  1  1.13 

problem  2  0.73 

2.64 

1.50 

0.25 

problem  1  0.30 

problem  3  0.17 

0.49 

1.11 

'Largest  and  smallest  solution  times  in  a  sample  of  5  problems  using  a  computer  PRIME  400. 
"Smallest  solution  time  reported  by  Villarreal  and  karwan  with  a  computer  CDC  6400. 
‘"Smallest  solution  time  reported  by  Bitran  on  a  computer  Burroughs  B6700. 


The  combination  of  the  efficient  point  and  utility  theory  approaches  that  we  have  explored 
draws  on  the  strengths  of  both  methods.  The  computational  results  conducted  suggest  that  the 
methodology  developed  reduces  the  task  of  the  decision  maker  to  choosing  among  a  small 
number  of  alternatives.  The  results  illustrated  in  Tables  5  and  6  are  typical  of  those  obtained  in 
a  more  extensive  computational  analysis  that  we  have  performed. 

The  lower  computation  times  observed  for  problems  of  type  (FLP)  suggest  that  there 
might  be  much  to  gain  by  developing  methods  for  special  structures.  Another  topic  to  be 
explored  is  the  comparison  in  practical  settings  of  the  approach  proposed  in  this  paper  with  the 
Filtering  Method  introduced  by  Steuer  [9].  The  purpose  of  the  Filtering  Method  is  to  avoid 
presenting  the  whole  efficient  set  to  the  decision  maker  by  selecting  one  representative  of  each 
cluster. 
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ABSTRACT 

Suppose  one  object  is  hidden  in  the  A-ih  of  n  boxes  with  probability  p(k). 
The  boxes  are  to  be  searched  sequentially.  Associated  with  the  >lh  search  of 
box  A  is  a  cost  c  (j. k )  and  a  conditional  probability  q(j.k)  that  the  first  j  -  1 
searches  of  box  k  are  unsuccessful  while  the  >lh  search  is  successful  given  that 
the  object  is  hidden  in  box  k. 

The  problem  is  to  maximize  the  probability  that  we  find  the  object  if  we  are 
not  allowed  to  offer  more  than  L  for  the  search.  We  prove  the  existence  of  an 
optimal  allocation  of  the  search  effort  L  and  state  an  algorithm  for  the  con¬ 
struction  of  an  optimal  allocation.  Finally,  we  discuss  some  problems  concern¬ 
ing  the  complexity  of  our  problem. 


1.  INTRODUCTION 

An  oblivious  person  loses  his  glasses  again  and  again  in  his  own  house  which  consists  of  n 
rooms.  He  has  to  decide  how  he  will  search  for  the  glasses.  Moreo  '^r,  he  has  to  notice  that  he 
may  overlook  the  glasses  when  he  searches  the  right  room.  In  order  to  search  "optimally”  he 
estimates  the  following  data:  p{k)  for  A:  €  B  -  {1 ,  . . .  ,  n),  the  a  priori  probability  that  he  has 
lost  the  glasses  in  the  Ar-th  room,  c(j,k)  6  R+  for  j  6  M  and  k  €  B,  the  length  or  cost  of  the 
>th  search  of  room  A:  and  q(j.k)  for  j  €  M  and  k  £  B,  the  conditional  probability  that  the  first 
j  -  1  searches  of  room  k  are  unsuccessful  while  the  >th  search  of  room  k  is  successful  given 
that  the  glasses  have  been  lost  in  room  Ar.  Since  these  events  exclude  each  other 
£  q(j.k)  <  1. 

i  <  j  <  — 

The  length  of  an  inspection  surely  depends  on  the  searched  room  but  it  is  also  reasonable 
to  assume  that  it  depends  on  the  number  of  previous  searches.  On  the  one  hand  some  parts  of 
the  room  are  clearly  arranged  so  that  we  have  to  search  them  only  once,  on  the  other  side  we 
open  the  cupboards  not  before  the  fourth  search  of  a  room. 

More  important  situations  where  we  can  apply  this  search  problem  are  the  following:  the 
search  for  a  defect  in  a  large  system,  the  search  for  oil  or  a  mineral  in  different  regions  and  the 
search  for  a  sunken  ship  where  we  partition  the  ocean  to  map  squares.  In  the  following  we 
shall  talk  about  the  search  for  an  object  hidden  in  one  of  n  boxes. 

We  have  to  decide  what  we  intend  by  an  optimal  search.  We  may  try  to  minimize  the 
expected  cost  of  a  successful  search.  This  problem  has  been  solved  by  the  author  [6).  But 
often  we  are  not  able  to  search  for  the  hidden  object  until  it  is  found.  If  we  want  to  visit  the 
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theater  the  time  for  the  search  of  our  glasses  is  limited.  Looking  for  a  sunken  ship  we  may 
hope  only  for  a  time  period  of  length  T  that  we  save  survivors.  A  company  digging  for  a 
mineral  has  only  limited  funds.  The  best  we  can  do  in  these  situations  is  to  maximize  the  pro¬ 
bability  of  a  successful  search  without  offering  more  than  a  given  limit  L  €  . 

For  this  problem  we  do  not  have  to  decide  in  which  order  we  will  search  the  boxes  but 
only  how  often  we  want  to  search  each  box.  Each  function  a  :  B  — 1 >  N0  -  Not/  1°°}  is  an  allo¬ 
cation:  search  box  k  for  a(k)  times  ( k  €  B).  If  we  use  an  allocation  a  we  have  to  be  able  to 
pay  all  the  prescribed  searches.  The  cost  of  these  searches  amounts  to  C(a )  £  ]£ 

kiB  l^j<a(k) 

c(J,k)  (e(°°,k)  -  0).  Because  we  shall  consider  only  allocations  whose  cost  does  not  exceed 
the  given  limit  L  we  call  an  allocation  a  /.-admissible  iff  C(«)  <  f.  We  find  the  object  during 
the  >th  search  of  box  k  iff  it  is  hidden  there;  we  have  overlooked  it  for  j  —  1  times  and  do  not 
overlook  it  during  the  >th  search  of  k.  The  probability  of  this  event  is  obviously  p(k)q(j,k). 
Therefore  P(a),  the  probability  of  the  success  of  our  allocation,  comes  to 

P(a)  £  £  p(k)qU.k)  (q(°°,k)  -  0). 

kkB  1  <  a<*> 

a  is  /.-optimal  if  it  is  /.-admissible  and  no  other  /.-admissible  allocation  has  a  larger  proba¬ 
bility  of  success.  If  we  have  computed  an  /.-optimal  allocation  a  we  may  use  the  results  of  [6] 
to  decide  in  which  order  we  should  realize  the  searches  which  are  prescribed  by  a. 

Kadane  13)  has  investigated  this  problem.  He  assumes  that  e(j.k)  :«  p{k)q(j,k)c(j,k)~ 1 
is  decreasing  for  each  k  as  a  function  of  j  and  states  an  algorithm  which  computes  an  /.-optimal 
allocation  if  it  stops.  Kadane  notes  that  his  algorithm  stops  if  c  is  bounded  below  by  a  constant 
«  >  0.  This  is  not  surprising  because  in  that  case  the  number  of  L-admissible  allocations  is 
finite.  We  give  a  generalization  of  Kadane's  algorithm  and  we  prove  that  it  stops  for  each 
search  problem  and  that  it  computes  always  an  /.-optimal  allocation.  Arkin  [2]  and  Stone  [51 
deal  with  even  more  special  cases  than  Kadane. 

We  shall  now  discuss  why  the  assumption  of  Kadane  simplifies  the  problem.  p(k)q(J,k ) 
is  the  probability  that  the  search  (J,k),  i.e.,  the  >th  search  of  box  k.  is  successful  while  the  cost 
of  this  search  amounts  to  c(j,k).  The  quotient  of  the  profit  and  the  cost,  e(j,k)  is  the 
efficiency  of  this  search.  Now  it  seems  to  be  reasonable  that  we  try  to  perform  the  most 
efficient  searches.  If  the  assumption  of  Kadane  is  not  fulfilled  the  most  efficient  search  may  be 
the  seventh  search  of  box  k.  But  we  cannot  perform  this  search  without  realizing  the  first  six 
searches  of  box  k  which  may  be  very  inefficient. 

2.  THE  DUAL  SEARCH  PROBLEM 

First  we  introduce  some  definitions  which  will  simplify  the  notation.  p(j,k) 
p(k)q(j,k)  is  the  probability  of  success  of  the  search  (J,k).  p*(J,k)  £  p(i,k)  is  the 

1 4 1> < ; 

probability  of  success  and  c*(j,k)  £  cii.k)  is  the  cost  of  the  first  j  searches  of  box  k. 

i  <  <<j 

For  each  allocation  a  we  conclude  P(a)  -  £  p*la(k),  k)  and  C(a)  -  ]£  c*(a(k),  k). 

kt B  *  €  B 

For  each  box  k  let  /*  be  the  following  step  map  on  [0,c*(°°,fr)].  For 
jr  6  [c*(y  -  1.  k),  c*(j,k ))  let  /*Or)  p*(j  -  1,  k)  and  fk(c*(°o,k ))  :«  lim  /*(*). 

X  —  c‘(oo,k) 
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Then  fk{c*(J,k ))  -  p*(J,k )  and  Pia)  -  £  fkic*iaik),  k)).  Lei  gk  be  the  continuous  func- 

k  €  B 

tion  which  combines  linearly  the  points  (c*(j.k),  p*(J,k))  and  Jet  hk  be  the  smallest  concave 
function  nowhere  smaller  than  J\.  Finally,  let  Bik)  :=»  [j  6  N0 1 /*(c*0'.^))  ”  hk(c*(J,k))}. 
Figure  1  illustrates  these  definitions. 


Obviously,  0,  <»  €  Bik).  The  slope  of  gk  in  an  interval  (c*(J  —  1,  k).  c*{j,k ))  is  equal 
to  the  efficiency  of  the  search  (j.k).  If  and  only  if _the  assumption  of  Kadane  is  fulfilled  gk  is 
concave.  This  is  equivalent  to  gk  -  hk  and  Bik)  -  M0. 

In  (61  it  turned  out  that  the  searches  J  +  1,  ....  J'  of  k  "belong  together"  if  J'  is  the 
direct  successor  of  J  in  Bik).  In  that  paper  the  dual  search  problem  to  a  search  problem  is 
defined  by  combining  the  searches  which  belong  together.  The  dual  problem  is  useful  for  this 
paper  too. 
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DEFINITION  1 :  The  dual  search  problem  of  a  search  problem  is  defined  by  the  parame¬ 
ters  ri,  p',  c\  q'  .  n':=  n,  p'  :=  p.  If  B(k)  has  infinitely  many  elements,  we  enumerate 
B(k)  -  (°°)  in  the  following  way:  0  =:  B0k  <  Blk  <  _  Let  c'(j,k )  :=  £  c(i.k) 

Bi-k 

and  q'ij.k)  £  q(i,k).  If  B(k )  has  finitely  many  elements  we  enumerate  B(k) 

*,-u 

by  0—:  B0k  <  ...  <  Bm{k)k  :=  °o.  For  j  <  m(k)  we  define  c'ij.k )  and  q'(j,k)  as  before, 
and  for  j  >  m(k)  we  define  additional  searches  by  c'ij.k ):  =  1  and  q'ij.k):  =  0. 

Looking  at  Figure  1  it  is  easy  to  see  that  g‘k  -  hk.  Therefore,  g'k  is  concave  and  the 
assumption  of  Kadane  is  always  fulfilled  for  the  dual  search  problem. 

Before  we  try  to  compute  an  /.-optimal  allocation  we  prove  its  existence. 

3.  THE  EXISTENCE  OF  /.-OPTIMAL  ALLOCATIONS 

Since  the  set  of  /.-admissible  allocations  is  never  empty  (let  aik)  =  0  for  all  k)  and  since 
the  probability  of  success  of  an  allocation  never  exceeds  1  Psup  :=  supl/Mallo  /.-admissible)  is 
always  well  defined.  There  exists  always  an  /.-admissible  allocation  a,°  so  that 
/>(a,°)  ^  Psup  -  2-'.  We  note  that  we  don't  know  how  to  compute  a,0. 

In  order  to  prove  the  existence  of  an  /.-optimal  allocation  we  define  a  subsequence  a"  of 
a,°  with  some  additional  properties.  If  a*_,( I  <  k  <  n)  is  already  defined,  we  define  a*  as  a 
subsequence  of  a,*-1  in  the  following  way.  If  a  * - 1  ( Ar )  is  bounded  we  choose  a*  so  that  a,*(fc)  is 
constant  for  i  €  N.  Otherwise,  we  choose  a*  so  that  a,M/c)  >  /  for  /  6  IN.  By  these 
definitions  a"  is  /.-admissible  and  P(a ,")  ^  Psu p  -  2~'.  If  a"(k)  (/  6  IN)  is  a  constant  sequence 
let  a(k)  be  equal  to  this  constant.  The  set  of  these  boxes  is  called  B.  Otherwise,  a, "(A)  >  /. 
Let  a(k )  °°  and  B"  :=  B  —  B'. 

Because  P(a)  ^  P(a")  >  Psup  -  2~‘  for  all  /  we  conclude  that  P(a)  >  Psu p.  In  order  to 
prove  that  a  is  /.-optimal,  it  is  sufficient  to  prove  that  a  is  /.-admissible. 

C(a)  —  £  c*(a(k),  k)  +  c*i°°.k) 

k(B  kiB 

—  lim  (  Y  c*(a(A),  k)  +  T  c*(i,k))  <  lim  C(a")  <  L. 

Now  we  have  proved 

THEOREM  1:  The  set  of  /.-optimal  allocations  is  not  empty.  Pmax  :=  max  {P(a)\a  L- 
admissible)  is  well  defined. 

4.  BOUNDS  ON 

First  we  give  a  lower  bound  on  Pn,,,  by  selecting  an  /.-admissible  allocation  which  seems 
to  be  rather  good.  Since  for  the  dual  search  problem  the  assumption  of  Kadane  that  e'(j,k)  is 
for  each  k  decreasing  is  fulfilled,  we  may  fix  a  complete  order  on  the  set  of  all  searches  in  such 
a  way  that  (J.k)  precedes  ij'.k’)  if  e'(j.k)  >  e'ij'.kj  and  that  (j  —  1 ,  k)  always  precedes  (j,k). 
We  note  that  in  this  order  each  search  has  a  well-defined  direct  successor  (but  not  always  a 
direct  predecessor). 
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By  a'(j,k)  we  denote  that  allocation  which  prescribes  exactly  all  searches  preceding  (J,k) 
(but  not  (J,k)).  (J,k )  is  called  the  crucial  search  of  a'ij.k).  It  is  easy  to  prove  (see  (31  or  (5J) 
that  a'ij.k)  is  C'(a'0‘,Ac))-optimal.  But  we  cannot  choose  how  much  we  will  offer.  Our  funds 
are  limited  by  L.  If  the  cost  of  all  searches  does  not  exceed  L  let  a'  =  °°  be  our  L-admissible 
allocation  which  of  course  is  L-optimal.  Otherwise,  there  is  exactly  one  search  (j,k)  so  that 
a'  —  a'(J,k )  is  L-admissible  but  a'ij'.k')  where  ij'.kj  is  the  direct  successor  of  (J,k)  is  not  In¬ 
admissible.  a' seems  to  be  a  good  /.-admissible  allocation  for  the  dual  problem. 

In  order  to  imitate  a'  by  an  L-admissible  allocation  we  use  the  enumeration  of  Bik)  or 
B(k)  —  (ooj  of  Definition  I.  If  a'ik)  - .«>  or  if  a'ik)  >  mik)  (if  Bik)  has  finitely  many  ele¬ 
ments)  let  aik)  Otherwise,  a  (Ac)  We  have  decomposed  a  search  of  the  dual 

problem  to  its  components  from  which  it  has  been  combined.  We  only  omit  the  additional 
searches.  Therefore,  C(a)  <  C'iaj  <  L.  We  call  a  the  good  allocation  for  our  search  prob¬ 
lem.  If  a'  =  a'ij.k )  and  j  <  m(Ac)  call  the  last  of  the  components  of  the  crucial  search 
( j.k )  for  the  dual  problem  the  crucial  search  of  a.  That  means  ( Bjk,k )  is  the  crucial  search  of 
a  is  a'=  a'ij.k)  and  j  <  m(k). 

PROPOSITION  1:  The  good  allocation  a  is  for  each  search  problem  L-admissible  and 
Pia)  <  Fmax.  We  like  to  mention  that  Kadane  has  already  used  the  allocation  a'.  For  the 
computation  of  a  good  upper  bound  on  Pmax,  we  introduce  a  new  concept. 

DEFINITION  2:  A  randomized  allocation  is  a  pair  (a, it)  where  a  is  an  allocation  and 
it  :  B —*  [0, 1]  so  that  nik)  =  0  if  a  (Ac)  -  ia.it)  prescribes  the  following  procedure: 
search  box  Ac  for  a  a  (Ac)  times  and  then  perform  the  next  search  of  box  k  with  probability 
7r(Ac).  The  expected  cost  of  ia.ir)  is  Cia.it )  :«  £  (c*(a(A:),  k)  +  itik)  ciaik)  +  1,  k)) 

ktB 

and  its  expected  probability  of  success  comes  to  Pia.it)  £  ip'iaik),  k)  + 

kiB 

ir(k)piatk)  +  1.  Ac)). 

If  it  =  0  we  obtain  a  standard  allocation.  Let  pia, it, k)  p*iaik),  Ac)  + 
ir(k)piaik)  +  1,  Ac)  and  cia.it.k)  c*iaik),  Ac)  +  irik)ciaik)  +  1,  Ac).  Then  we  con¬ 
clude  by  the  definition  that  pia.it ,k)  **  gkic ia.it ,k))  (see  Figure  1)  and  Pia.it)  — 
£  gkicia,ir ,k)). 
kiB 


If  the  assumption  of  Kadane  is  fulfilled  it  is  easy  to  define  an  L-optimal  randomized  allo¬ 
cation  ia.it).  Let  a  be  the  good  allocation.  If  a  =  «>  let  it  3  0  and  ia.it)  is  obviously  L- 
optimal.  Otherwise,  a  =  aij.k).  Let  7r(Ac')  -  0  for  Ac'  ^  Ac  and  irik):  —  (L  —  Cia))cij,k)~\ 
Obviously,  Cia.tr)  -  L.  That  ia.it)  is  L-optimal  may  be  proved  by  an  application  of  the  gen¬ 
eralized  Neyman-Pearson-lemma  (see  Kadane  [3]):  ia.it)  prescribes  all  searches  whose 
efficiency  is  larger  than  eij.k),  no  search  whose  efficiency  is  smaller  than  eij.k)  and  as  many 
of  the  searches  with  efficiency  eij.k)  that  Cia.it)  -  L. 

PROPOSITION  2:  Let  ia\  itj  be  an  L-optimal  allocation  for  the  dual  search  problem. 
Then  Pm„  <  P'ia\  it1). 

PROOF:  Since  the  assumption  of  Kadane  is  fulfilled  for  the  dual  problem,  we  can  com¬ 
pute  ia’,  itj  by  the  procedure  above. 


t 
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Let  a  be  an  L-optimal  allocation  for  the  original  problem  which  exists  by  Theorem  1. 
Then  Pmax  -  Pia)  -  £  fkic*iaik),  k ))  <  £  hkic*iaik),  k))  -  £  g'kic*iaik),  k )). 

kiB  kiB  k(B 

The  last  equality  follows  from  the  fact  that  the  function  hk  for  the  original  problem  equals  the 
function  g'k  for  the  dual  problem  (see  Section  2). 

Now  we  imitate  a  by  an  L-admissibie  randomized  allocation  ia,  n)  for  the  dual  problem. 
We  use  the  enumeration  of  Bik)  or  Bik)  -  {«)  of  Definition  1.  If  aik)  -  °°  let  aik)  «> 
if  Bik )  has  infinitely  many  elements  and  aik )  mik)  otherwise.  ifik)  0  in  both  cases. 
Otherwise,  there  exists  l(k)  so  that  Bak)k  <  a(k)  <  B,,k)  +  ]  k.  Let  aik)  l(k)  and 
i r(Ac)  (c*iaik),  k)  -  c*'iaik )  k))  c'iaik)  +1,  k)~\  It  follows  c'ia,  n,  k)  — 
c*‘iaik),  k)  +  nik)  c'iaik)  +  1,  k)  =  c*iaik),  k)  and  C'ia,  n)  =  Cia)  <  L.  Thus, 
(a,  n)  is  L-admissible.  Finally,  we  combine  our  results 

P max  <  1  g'k(c*(a(k),  k))  -  T  g'k  ( c'(a ,  n,  k))  -  P'ia,  n) 

kiB  ktB 

<  P'{a',  it').  Q.E.D. 


5.  THE  DECOMPOSITION  OF  A  SEARCH  PROBLEM 

We  are  now  able  to  compute  a  lower  and  an  upper  bound  on  PmiX.  If  these  bounds  coin¬ 
cide  the  good  allocation  a  of  proposition  1  is  L-optimal.  Otherwise  we  don’t  know  whether  a  is 
L-optimal.  Analogously  to  [31  we  divide  the  problem  to  two  subproblems.  Let  AiL)  be  the 
set  of  all  L-admissible  allocations, 

A'iL)  :=»  {a  €  A(L)\a(k)  <  j )  and  A"(L)  [a  €  /I(L)|a(Ic)  >  j\. 

The  computation  of  an  optimal  allocation  fo"  the  first  (second)  subproblem  is  equivalent  to  the 
computation  of  a  best  of  all  allocations  of  A'iL)  iA"iL)).  We  "exclude"  ("include")  the  search 
ij,  k). 

For  both  problems  we  compute  lower  and  upper  bounds.  If  for  the  undivided  problem 
with  the  largest  upper  bound  the  lower  and  the  upper  bound  coincide,  we  computed  an  L- 
optimal  allocation.  Otherwise,  we  divide  this  problem  and  so  on.  Since  we  don’t  suppose  that 
the  assumption  of  Kadane  is  fulfilled,  we  have  to  choose  more  carefully  than  Kadane  the  search 
for  which  we  divide  the  problem.  We  have  to  ensure  that  the  algorithm  stops  and  we  want  to 
get  an  algorithm  which  is  as  fast  as  possible.  At  first  we  show  how  we  divide  a  problem. 

(i)  The  exclusion  of  the  search  ij,  k). 

Let  S'  -  in',  p',  c‘,  q',  L')  be  the  following  search  probtem:  n'  -  n,  p'  -  p ,  L'  -  L,  the 
searches  ij',  k')  where  k'  ^  k  or  k’  -  k  and  /  <  j  have  the  same  cost  and  the  same  probabil¬ 
ity  of  success  as  before  and  for  /  >  j  let  c’if,  k)  -  L  +  1  and  q'ij',  k)  -  0.  Obviously,  an 
allocation  a  is  L '-admissible  for  S' iff  it  is  an  allocation  of  A’iL)  for  the  original  problem  S.  In 
that  case  the  probability  of  success  of  a  is  the  same  whether  we  use  a  for  5  or  for  S'.  Thereu¬ 
pon,  a  is  L’-optimal  for  S' iff  it  is  the  best  of  all  allocations  of  A'iL)  for  S. 

(ii)  The  inclusion  of  the  search  ij,  k). 

Let  S"  -  in",  p",  c",  q",  L")  be  the  following  search  problem:  n"  -  n,  p"  -  p, 
L"  —  L  -  c*ij,  k),  the  searches  (/,  k')  where  k'  *  k  have  the  same  cost  and  the  same  proba¬ 
bility  of  success  as  before  and  for  /  €  N  let  c"(i,  k)  -  cii  +  j,  k)  and  q"ii,  k)  -  qii  +  j,  k). 
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L"  <  0  iff  L  <  c"  0,  k),  i.e.,  iff  the  cost  of  the  included  searches  is  larger  than  our  funds. 
Therefore,  L"  <  0  iff  A"(L)  -  Q.  For  an  I  "-admissible  allocation  a"  for  S",  we  define  an  allo¬ 
cation  a  €  A"(L)  for  S  by  a(k')  a"ik')  for  k'  ^  k  and  a(k)  a"(k )  +  j.  By  this 
definition  C(a)  -  C"(a")  —  c*(J,  k)  and  a  e  A"(L)  for  S  iff  a"  is  /."-admissible  for  S". 
Therefore,  we  have  a  one-to-one  correspondence  between  A"(L)  for  S  and  the  set  of  in¬ 
admissible  allocations  for  S".  Since  P(a)  -  P"(a")  -  p*(J,  k)  a  is  the  best  of  all  allocations  of 
A"(L)  for  Siflf  a" is  /"-optimal  for  S". 

We  show  how  we  obtain  an  order  on  the  searches  of  the  dual  problem  of  S".  For  the 
included  search  (j,  k),  we  assume  j  €  B(k).  Therefore,  j  -  Btk  for  some  /  (see  Definition 
I).  If  we  erase  the  first  /searches  of  k  in  the  order  of  the  searches  of  the  dual  problem  of  S 
and  if  we  replace  the  search  (/,  k)  for  i  >  I  by  the  search  (/'  -  /,  k),  we  obtain  an  order  on  the 
searches  of  the  dual  problem  of  S". 

6.  AN  ALGORITHM  FOR  THE  CONSTRUCTION  OF  AN  /-OPTIMAL  ALLOCATION 

We  have  the  task  to  compute  an  Z0-optimal  allocation  for  the  search  problem  So  “  (n0, 
Po .  c0,  qq,  L0),  i.e.,  an  allocation  of  A0(L0 )  with  the  largest  possible  probability  of  success.  In 
the  course  of  our  algorithm  we  partition  /t0(/0)  to  disjoint  subsets  /L(Z0)  by  the  inclusion 
and/or  exclusion  of  some  searches  which  will  be  determined  later.  Therefore,  (see  Section  5) 
the  computation  of  a  best  of  all  allocations  of  A,iLo)  is  equivalent  to  the  solution  of  a  search 
problem  S,  -  («,.  p„  cl$  q,,  /,). 

We  are  able  to  compute  functions  u,  :  B  -*  1N0  so  that  A,(L0)  -  {a  €  /t0(/0)|/,(k) 
^  a(k)  <  u,{k)\.  Let  /0  s  0,  u0  S  <».  If  S,  is  generated  from  S,  by  the  inclusion  (exclu¬ 
sion)  of  the  search  O',  k),  we  conclude  that  u,  £  u„  l,(k')  -  l,(k')  for  k' s*  k  and 
/,(*)  -  l,(k)  +  j,  resp.  /j  =  /„  u,(k')  -  «,(/')  for  k'  ^  kand  qik)  —  /,(/)  +  j  -  1. 

By  l,  we  denote  the  set  of  the  indices  of  undivided  search  problems.  For  S,  we  compute 
the  good  /, -admissible  allocation  a*  of  Proposition  1.  By  the  results  of  the  previous  section 
a,  a *  +  /,  is  an  /0-admissible  allocation  for  S0  and  Pq(o,)  -  Pj(a*)  +  Po(i,)  is  a  lower 
bound  on  Furthermore,  a,  is  the  best  of  all  allocations  of  A,(L0)  iff  a *  is  /, -optimal  for 
5,.  Then  we  compute  by  Proposition  2  an  upper  bound  U*  on  P'mtx,  the  probability  of  success 
of  an  //-optimal  allocation  for  Sr  Again  by  the  results  of  the  previous  section  we  can  conclude 
that  Ui  U*i  +  PqU,)  >  max  (P0(o)|o  6  A,(L0))-  Since  the  sets  Am{L0)  (mtl,)  form  a  par¬ 
tition  of  A0(L0)  max  {Um\m  €  /,}  is  an  upper  bound  on  P^x- 

Finally,  we  have  to  discuss  how  we  divide  the  search  problem  S,  with  the  maximal  value 
of  Um(m  €  //).  If  Po(a,)  —  U,  the  allocation  o,  is  Z0-optimal  for  S0  since  always  P0(a,) 
^  P„,t  <  U,.  If  a*,  prescribes  for  S,  all  searches  with  positive  efficiency  Pt(a*,)  -  U*,  and 
therefore,  P0(,a,)  -  P,(a*t)  +  P0U,)  -  U*,  +  P0(l,)  -  U,. 

Otherwise,  we  have  defined  in  Section  4  a  crucial  search  (/„  k,)  for  a*,  where  j,  €  B,(k,). 
\f  j,  <  o°  we  divide  S,  by  the  inclusion  (exclusion)  of  this  crucial  search.  We  try  to  give  rea¬ 
sons  for  this  choice.  If  the  assumption  of  Kadane  is  fulfilled  and  if  we  realize  the  searches 
according  to  their  fixed  order,  the  crucial  search  is  the  first  search  which  we  are  not  able  to  real¬ 
ize  since  our  funds  are  exhausted.  This  justifies  the  name  crucial  search  In  order  to  obtain  a 
fast  algorithm  it  seems  to  be  reasonable  to  choose  this  search  for  the  decomposition  of  S,.  In 
general  case  we  imitate  the  above  procedure  by  imitating  the  above  allocation  for  the  dual  prob¬ 
lem.  Let  (/„  k\)  be  the  first  search  which  we  are  not  able  to  realize  since  our  funds  are 
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exhausted.  Then  k,  -  k',  and  j,  -  min  {/  €  M  |  i  >  /,,  /€  We  again  divide  the  prob¬ 

lem  by  the  search  (J,.  k,)  (and  not  by  (J'„  kj))  because  this  ensures  that  the  algorithm  stops. 

If  J,  “  00  it  makes  no  sense  to  exclude  (J,,  k,).  If  we  decide  to  divide  S,  by  (/,•  *»)  the 
algorithm  may  not  stop.  In  order  to  ensure  that  the  algorithm  stops  we  divide  S,  to  more  than 
two  subproblems.  In  doing  so  we  guarantee  that  for  the  search  problem  where  no  search  has 
been  excluded  the  funds  do  not  exceed  0.  Let  us  imitate  that  allocation  for  the  dual  problem 
which  prescribes  in  addition  to  the  searches  of  the  good  allocation  the  crucial  search.  The 
result  is  b,  where  b,(k)  -  a,(k)  for  k  ^  k,  and  b,(k,)  -  °°.  Since  C,(b,)  >  L,  we  may  choose 
an  allocation  b\  :  B  — •  M0  so  that  b',(.k)  <  b,(k)  and  C,(b',)  >  L,.  Let  {Ac1,  ....  fr'}:  — 
(it  €  B\b\(k)  >  0)  and  R  :-  2'.  We  divide  S,  to  R  subproblems  which  correspond  to  the  R 

different  sequences  "inclusion/exclusion  of  (b',(kl),  it1) .  inclusion/exclusion  of 

(b’,(kr),  it')."  Since  b',(k')  <  °°  these  inclusions  and  exclusions  are  well  defined.  Finally,  we 
use  the  convention  that  f/(:  -  -1  iff  L,  <  0,  i.e.,  iff  A^Lq)  -  0. 

For  the  search  problem  5,  the  computation  of  the  necessary  data  means  the  computation 
of  ah  P(.0i).  Uj  and  -  if  defined  -  (J„  k,)  if  L,  >  0  and  the  definition  U,  -1  if  L,  <  0. 

THE  ALGORITHM 


Input:  the  search  problem  S0  -  (n0.  Po •  Co-  <7o-  £<))• 

Step  1:  L0<  0  -=■>  A0(L0)  -9  —  STOP 

L0  >  0  =■»>  /  :=  0.  /0  (0),  compute  the  necessary  data  for  Sq  —  Step  2 

Step  2:  Choose  t  €  l,  so  that  (/,  -  max  (l/mlrn  €  /,}. 

P(a,)  —  U,  ■— >  a,  is  Lo'OPfintal  STOP 

P(a,)  *  U,,  j,  <  00  —  Step  3 

P(a,)  *  U„  j,  -  00  —  Step  4 

Step  3:  Divide  S,  by  the  inclusion/exclusion  of  (j,,  k,)  to  SJ+,  and 
S,+2-  Compute  the  necessary  data  for  Si+\  and  Si+2-  Increase  /  by  2. 

Set  !i  -  (/(_2U{/  -  1.  /})  -  {/}  —  Step  2 

Step  4:  Divide  S,  by  the  R  sequences  inclusion/exclusion  of  (i>',  (Ac1) . 

/c1) . inclusion/exclusion  of  (b',(kr),  kr)  to  S,+1,  ....  S,+K.  Com¬ 
pute  the  necessary  data  for  St+t . S/+K.  Increase  /  by  R.  Set 

A  “  (/,_rU(/  -  R  +  1 . /})  —  (rj  — ►  Step  2 


Combining  our  results  we  have  proved 

THEOREM  2:  If  L0  >  0  and  the  algorithm  stops  a,  is  an  L0-optimal  allocation. 

7.  THE  ANALYSIS  OF  THE  ALGORITHM 

THEOREM  3:  The  algorithm  stops  for  each  search  problem. 

PROOF:  We  define  for  each  search  problem  a  characteristic  ( V,  IF).  Fis  the  number  of 
boxes  for  which  infinitely  many  searches  have  positive  efficiency.  For  the  other  boxes  let  the 
IF(fc)-th  search  be  the  last  search  with  positive  efficiency.  IF  is  the  sum  of  these  W(k).  We 
prove  the  theorem  by  induction  on  the  lexicographical  order  on  ( V,  W)  is  smaller  than 
( V,  W)  iff  V  <  For  V  -  Fand  W'  <  W. 
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For  ( V,  IF)  -  (0,0)  all  searches  have  efficiency  0  and  the  algorithm  stops  by  the  first  pas¬ 
sage  of  Step  1  or  2.  Now  let  the  assertion  of  the  theorem  be  valid  for  all  ( V.  If")  smaller  than 

( v,  m. 

Let  5  be  a  search  problem  with  the  characteristic  (F,  IF).  If  the  algorithm  stops  by  the 
first  passage  of  Step  1  or  2  the  assertion  of  the  theorem  is  valid  for  S.  Otherwise,  S  becomes 
divided.  We  compute  an  optimal  allocation  for  S  while  we  compute  for  all  subproblems  an 
optimal  allocation  by  the  algorithm  and  while  we  compare  these  allocations.  This  new  pro¬ 
cedure  is  at  most  as  fast  as  the  algorithm.  Therefore,  it  is  sufficient  to  prove  the  assertion  for 
all  subproblems  S'. 

If  we  include  or  exclude  a  search  the  characteristic  ( V\  If")  of  the  new  problem  is  at  most 
as  large  as  the  characteristic  ( V,  W)  of  the  given  problem. 

CASE  1:  During  the  definition  of  S',  we  have  excluded  some  search. 

(i)  If  we  have  excluded  a  search  (j,k)  so  that  infinitely  many  searches  of  k  had  positive 
efficiency  V  <  V  and  the  algorithm  stops  for  S'  by  the  induction  hypothesis. 

(ii)  If  (i)  is  not  fulfilled  and  if  we  have  excluded  a  search  (J.k)  where  j  <  W(k),  we 
conclude  that  F'  -  V,  W'(k)  <  Wik)  and  W'(k')  <  W(k')  for  k'  *  k.  Therefore,  W'  <  W 
and  the  algorithm  stops  for  S' by  the  induction  hypothesis. 

(iii)  If  (i)  and  (ii)  are  not  fulfilled  we  have  excluded  a  search  (J.k)  where  j  >  W(k). 
This  is  impossible  by  the  definition  of  our  algorithm.  If  (J.k )  is  the  crucial  search  (Step  3)  the 
good  allocation  prescribes  all  searches  with  positive  efficiency  and  the  algorithm  would  have 
stopped  at  Step  2.  If  (J.k)  is  excluded  at  Step  4  j  <  I V(k)  by  the  definition  of  b'. 

CASE  2:  S'  has  been  defined  at  Step  4  and  we  have  not  excluded  some  search.  The  cost 
of  the  included  searches  is  larger  than  our  funds.  Therefore,  L'  <  0  and  the  algorithm  stops 
immediately. 

CASE  3:  S'  has  been  defined  at  Step  3  and  we  have  not  excluded  some  search.  If 
( F',  W)  is  smaller  than  ( F,  IF)  the  algorithm  stops  for  S'  by  the  induction  hypothesis.  Other¬ 
wise,  ( V,  If")  «■  ( F,  IF).  Either  the  algorithm  stops  at  Step  1  or  2  and  the  assertion  is  proved  or 
S'  becomes  divided.  We  again  argue  that  it  is  sufficient  to  prove  the  assertion  for  all  subprob¬ 
lems  S"  of  S'  and  again  we  obtain  at  most  one  crucial  problem.  We  repeat  this  argumentation 
for  n  +  1  times.  If  the  theorem  is  not  proved  we  always  obtain  a  new  crucial  problem  with  the 
characteristic  (F,  IF)  by  including  a  search  at  Step  3.  Well  call  these  problems 
So-’  —  S,  Sj:  ■»  S',  Sj . -S’d+t- 

S(+|  is  generated  from  S,  by  the  inclusion  of  (Jlt  kt)  at  Step  3  (0  <  i  <  n).  While  we 
show  that  this  case  never  happens  we  prove  the  theorem. 

Since  J,  t  B,(kt)  the  end  of  the  search  (J,.  k,)  coincides  with  the  end  of  a  search  O',  k,) 
for  the  dual  problem.  We  remember  that  we  obtain  the  order  on  the  searches  of  the  dual  prob¬ 
lem  of  S/+i  by  erasing  the  first  j',  searches  of  k,  in  the  order  of  the  searches  of  the  dual  prob¬ 
lem  of  S,.  Let  a)  be  the  allocation  for  the  problem  of  S,  which  prescribes  all  searches  preceding 

<J',.  kf)  and  (J),  kt ),  too. 
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We  know  by  the  definition  of  (/,,  kf  that  the  cost  of  a',  exceeds  L,.  By  the  definition  of 
the  inclusion  of  0, ,  k,)  the  cost  of  the  searches  which  have  preceded  (/"<.  *,)  for  the  dual  prob¬ 
lem  of  S,  and  which  we  have  not  erased  exceeds  Li+]  for  the  dual  problem  of  S/+).  Among 
these  searches  is  no  search  of  box  k ,.  By  the  definition  of  the  crucial  search  0',+i.  *,+ 1),  we 
conclude  that  a'I+t(*)  <  a',(k)  for  all  k  and  a'/+1(fc()-0  while  a',(k,)  -  j\  >  0.  Thus, 
Ar0 . are  distinct  which  obviously  is  impossible.  Q.E.D. 

We  finish  this  paper  with  some  comments  about  the  algorithm. 

(1)  If  the  assumption  of  Kadane  is  fulfilled  we  never  reach  Step  4  and  our  algorithm  coincides 
with  the  algorithm  of  Kadane. 

(2)  At  each  time  max  \P0(am)\m  €  /,)  ^  P°,x  <  max  [U„\m€  /,}.  In  order  to  obtain  only 
a  nearly  optimal  Z.0-admissibIe  allocation,  we  may  stop  the  algorithm  if  the  difference  of 
the  two  bounds  is  small  enough. 

(3)  Always  using  Step  4  instead  of  Step  3,  we  obtain  also  an  algorithm  which  always  stops  and 
which  always  computes  an  optimal  allocation.  The  proof  of  Theorem  3  would  be  much 
easier  since  Case  3  becomes  superfluous.  But  it  seems  to  be  obvious  that  our  algorithm  is 
much  faster  for  nearly  all  search  problems. 

(4)  Obviously  our  algorithm  is  not  very  fast.  It  is  not  even  polynomial ly  time  bounded.  Let 
us  consider  the  following  assumptions  for  our  search  problem:  c(2,k)  -  L  +  1, 
c(l .A)  €  M,  L  €  N  and  p(l,/c)  -  d  c(l ,A)  where  d  €  R+.  Then  we  may  restrict  our¬ 
selves  to  allocations  a:5  — 1 *  {0, 1}.  Such  an  allocation  corresponds  to  the  set 
A  .-  {/c|fl(A)  -  1}.  We  conclude  C(a )  -  JT  c ( 1 , A)  and  P(a)  -  £  p{\,k)  -  d  C(a). 

kfA  kkA 

Therefore,  a  is  L-optimal  iff  C(a)  is  maximal  under  the  restriction  that  C(a)  <  L. 
While  computing  an  L-optimal  allocation  we  decide  whether  there  exists  an  allocation  a 
where  C(a)  -  L.  This  problem  is  known  as  the  knapsack  problem.  It  is  well  known  that 
the  knapsack  problem  is  NP-complete  (Karp  [41).  If  the  conjecture  of  many  computer 
scientists  that  NP  ^  P  is  correct  there  is  not  fast  algorithm  for  the  knapsack  problem.  In 
particular,  there  is  no  fast  algorithm  for  our  search  problem.  A  more  detailed  discussion 
of  these  ideas  may  be  found  in  [1]. 
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ABSTRACT 

The  problem  of  assigning  computer  program  modules  to  functionally  similar 
processors  in  a  distributed  computer  network  is  investigated.  The  modules  of  a 
program  must  be  assigned  among  processors  in  such  a  way  as  to  minimize  in¬ 
terprocessor  communication  while  taking  advantage  of  affinities  of  certain 
modules  to  particular  processors.  This  problem  is  formulated  as  a  zero-one 
quadratic  programming  problem,  but  is  more  conveniently  modeled  as  a  direct¬ 
ed  acyclic  search  graph.  The  model  is  developed  and  a  backward  shortest  path 
labeling  algorithm  is  given  that  produces  an  assignment  of  program  modules  to 
processors.  A  non-backtracking  branch-and-bound  algorithm  is  described  that 
uses  a  local  neighborhood  search  at  each  stage  of  the  search  graph. 


1.  INTRODUCTION 

Developments  in  hardware  technology  and  the  trend  toward  large  shared  data  base  sys¬ 
tems  have  contributed  to  the  feasibility  and  popularity  of  distributed  computing  systems.  A  dis¬ 
tributed  computer  network  is  considered  to  be  a  set  of  programmable  processors  interconnected 
to  some  extent  by  communication  facilities  [14].  Cost-effective  methods  must  be  developed  for 
these  systems  to  control  the  allocation  of  computing  resources  among  the  jobs  introduced  into 
the  network. 

Scheduling  theory  [6]  is  a  subject  that  has  received  a  great  deal  of  attention  since  the 
development  of  digital  computer  systems.  The  general  scheduling  problem  is  that  of  allocating 
limited  resources  among  multiple  tasks  when  choices  exist  in  the  allocation  process.  The  poli¬ 
cies  governing  the  apportionment  of  the  resources  are  called  scheduling  rules  or  scheduling 
algorithms.  Scheduling  algorithms  for  computer  systems  are  used  to  assign  a  set  of  jobs  to 
resources  which  are  used  in  executing  or  servicing  the  jobs  [4,  5 ,  17]. 

The  nature  of  job  scheduling  [24]  depends  on  the  functional  similarity  of  the  processing 
nodes  and  on  the  degree  of  communication  available  between  processors.  If  the  network  con¬ 
sists  of  functionally  different  processors,  then  job  scheduling  is  simple  since  each  job  would  be 
designed  for,  and  therefore  assigned  to,  one  particular  specialized  processor. 
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In  a  network  of  functionally  similar  processing  nodes,  it  may  be  possible  to  assign  the 
parts  of  a  program  freely  among  the  processors;  but  in  a  practical  sense,  the  communication 
links  in  a  distributed  network  constitute  inherent  bottlenecks  and  therefore  constrain  the  assign¬ 
ment  of  computational  tasks.  When  high  penalties  are  imposed  for  communication,  the  practi¬ 
cal  solution  is  to  minimize  the  amount  of  communication  between  processors  by  assigning 
related  tasks  to  the  same  processor.  However,  if  the  processors  in  the  network  were  fully  con¬ 
nected  by  high  capacity  data  links,  many  feasible  alternative  assignments  of  computational  tasks 
to  processors  would  exist  and  should  be  evaluated  by  the  job  scheduler.  In  such  cases,  inter- 
processor  communication  would  no  longer  be  regarded  as  a  serious  constraint,  and  indeed  the 
overall  efficiency  of  the  system  would  be  improved  through  intelligent  scheduling  that  takes 
such  communication  into  account. 

Recent  technological  advances,  such  as  the  economical  fabrication  of  processors  and  the 
development  of  broadband  communication  facilities,  suggest  that  this  is  precisely  the  character 
of  distributed  processing  that  will  becorr?  prevalent  in  the  near  future  [31].  In  fact,  the  archi¬ 
tecture  of  the  Cm*  multimicroprocessor  system  [23]  provides  a  good  example  of  the  type  of  dis¬ 
tributed  computers  that  will  be  considered  in  this  paper;  and  the  problem  of  distributing 
software  utilities  among  Cm*  processors  is  similar  to  the  assignment  problem  studied  here. 

In  Section  2,  the  problem  is  stated  and  formulated  as  a  nonlinear  programming  problem. 
In  Section  3,  a  directed  acyclic  search  graph  is  described  and  a  shortest  path  algorithm  is 
developed  that  yields  an  assignment.  A  branch-and-bound  algorithm,  described  in  Section  4, 
may  be  applied  to  the  search  graph  to  produce  assignments  of  generally  inferior  quality  but  with 
considerably  less  computational  effort.  An  analysis  of  the  algorithms  and  their  performance  is 
given  in  the  last  section. 

2.  STATEMENT  OF  THE  PROBLEM 

The  distributed  system  under  discussion  is  assumed  to  be  a  network  of  processors,  fully- 
connected  by  identical  communication  links  (so  that  the  cost  of  sending  one  unit  of  data 
between  any  two  processors  is  the  same).  Although  the  processors  are  assumed  to  be  function¬ 
ally  similar,  they  need  not  be  identical.  In  fact,  certain  processors  may  have  particular 
efficiencies  for  executing  particular  program  modules.  For  example,  some  processors  may  have 
high-speed  arithmetic  capabilities,  access  to  a  needed  data  base,  a  large  high-speed  memory, 
availability  to  required  peripheral  devices,  or  other  facilities  associated  with  them  which  make 
them  especially  well-suited  for  executing  certain  program  modules. 

The  programs  being  executed  within  the  distributed  computer  system  are  assumed  to  be 
partitioned  into  functional  modules  (containing  executable  code  and/or  data)  which,  in  general, 
may  reside  on  any  processor  in  the  system.  There  is  no  parallelism  or  multitasking  of  module 
execution  within  a  program.  Each  processor  may  be  multiprogrammed,  and  divide  its  time 
among  several  programs,  but  concurrent  execution  of  the  modules  in  one  program  is  not  con¬ 
sidered. 

The  modules  of  a  modular  program  are  to  be  assigned  among  the  processors  in  such  a  way 
as  to  take  advantage  of  the  affinities  of  some  modules  to  certain  processors  while  minimizing 
the  costs  of  communication  between  jobs  that  are  assigned  to  different  processors.  Thus,  there 
are  two  kinds  of  costs  that  must  be  considered  in  the  search  for  an  assignment: 
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a.  Each  module  has  an  execution  cost  that  depends  on  the  processor  to  which  it  is 
assigned.  Let  eu  represent  the  cost  of  executing  module  i  on  processor  j. 

b.  Any  two  modules  that  communicate  during  program  execution  incur  a  penalty  if  they 
are  assigned  to  different  processors.  (It  is  assumed  that  the  cost  of  such  communication  is  zero 
when  the  reference  is  made  between  modules  residing  on  the  same  processor.)  L  he  cost  of 
communication  between  program  modules  /  and  k  be  denoted  by  crt.  The  costs  eu  and  c,k  must 
be  measured  in  the  same  units  of  money  or  time. 

The  distributed  programs  considered  here  are  serial  programs  for  which  execution  can 
shift  from  one  processor  to  another.  Therefore,  all  execution  costs  and  communication  costs 
are  incurred  in  disjoint  time  intervals,  and  if  these  costs  are  measured  in  time,  then  the  cost  of 
the  assignment  is  actually  the  completion  time  of  the  program. 

The  desired  assignment  is  one  which  minimizes  the  sum  of  the  execution  costs  of  the 
modules  on  the  processors  and  the  incurred  intermodule  reference  costs.  The  problem  can  be 
formulated  as  a  zero-one  quadratic  programming  problem.  The  execution  costs  are  expressed 
in  the  objective  function  simply  as 

m  it 

Jr  Jt  e°x<J' 

Communication  costs  incurred  by  modules  not  assigned  to  the  same  processor  are  of  the  form 

«—l  n  m 

21  E  E  c«  d  —  xkj 

i-T  7-1  *-<+l 

»i-l  *t  m  m-1  h  m 

“III  c*xkj  ~  2)  E  E  cikxuxKi 

Pi  P 1  *->+!  /il  i-\  *-(+! 

*-l  m  (  »  ]  m-1  x  m 

-  E  Z  c<*  2>*/  -  I  E  E  c*xu*v 

i- T  *-/+ 1  /  i-i  P i  *-/+) 

Since  the  summation  in  parentheses  is  one,  and  the  summation  of  the  cA  is  just  a  constant,  the 
first  group  of  communication  terms  can  be  ignored  in  the  objective  function.  Thus  the  formu¬ 
lation  of  the  problem  is  as  follows: 

minimize 

m  n  m—  1  *  m 

E  E  eVX‘j  "III  c!kxuxkj 

1-1  }=\  (-1  7-1  *-l+l 

subject  to  the  constraints 


(cl) 

xu  -  0. 1 

for  all  /,  j 

( xtJ  -  1  if  module  i  is  assigned  to 
processor  j,  and  otherwise  ■■  0) 

(c2) 

7-1 

for  all  i 

(each  module  is  to  be  assigned  to  exactly  one  processor) 

where  m  is  the  number  of  program  modules  and  n  is  the  number  of  processors. 
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General  zero-one  polynomial  programs  can  be  converted  to  linear  programs  with  non¬ 
linear  secondary  constraints  (12,  32),  however  the  problem  at  hand  has  been  approached  by 
using  techniques  which  take  advantage  of  the  special  structure  of  this  problem. 

The  problem  has  been  solved  for  n  -  2  by  Stone  [29].  A  model  is  developed  that  can  be 
interpreted  as  a  commodity  flow  network,  and  an  assignment  is  made  by  applying  a  maximum 
flow  algorithm  (11).  Efforts  to  extend  this  method  to  the  general  ^-processor  case  have  not 
been  completely  successful. 

For  n-processor  problems  in  which  the  intermodule  reference  pattern  is  constrained  to  be 
a  tree,  an  optimal  assignment  can  be  obtained  by  using  a  shortest  path  algorithm  [3].  The 
graph  model  developed  for  this  restricted  problem  is  extended  in  the  next  section  to  allow  an 
arbitrary  module  intercommunication  pattern. 

Assignment  algorithms  such  as  the  ones  to  be  described  here  may  be  used  to  find  a  static 
assignment  of  modules  to  processors,  but  may  also  be  applied  repeatedly  during  the  execution 
of  a  program  to  reassign  modules  dynamically  as  the  program’s  working  set  changes.  (The 
dynamic  reassignment  problem  has  been  investigated  by  Bokhari  [2].) 

3.  SHORTEST  PATH  ALGORITHM 

A  state-space  representation  of  the  problem  [16,  22]  is  given  by  a  graph  in  which  the 
nodes  represent  program  modules  assigned  to  processors,  and  the  arcs  connecting  the  nodes 
represent  communication  between  modules.  The  node  [(/']  represents  the  assignment  of 
module  i  to  processor  j.  For  a  program  of  m  modules  to  be  assigned  among  n  processors,  the 
graph  model  will  contain  mn  nodes.  Arcs  are  defined  and  labeled  as  follows: 

a.  The  arc  from  node  [//]  to  node  My]  is  labeled  with  the  value  e^,  the  execution  cost  of 
module  k  on  processor  j.  (Since  both  modules  /'  and  k  reside  at  processor  j,  there  is  no  com¬ 
munication  cost  to  consider.) 

b.  The  arc  from  node  [y]  to  node  Ipq)  is  labeled  with  the  value  (eM  +  c^),  the  sum  of 
the  execution  cost  of  module  p  on  processor  q  and  the  cost  of  communication  between  modules 
i  and  p. 

c.  The  arc  from  node  [yl  to  node  [ik]  does  not  exist,  since  a  module  is  assumed  not  to 
communicate  with  itself. 

Directions  are  imposed  arbitrarily  on  the  arcs,  in  such  a  way  that  the  graph  is  acyclic. 
This  does  not  imply  a  precedence  ordering  among  the  program  modules,  but  rather  simply 
establishes  a  direction  in  which  a  search  procedure  is  to  advance  through  the  graph  model.  The 
acyclic  property  of  the  graph  is  easily  maintained  as  the  graph  model  is  built,  by  appropriately 
sorting  the  input  data.  The  data  is  arranged  so  that  arcs  are  always  directed  from  an  existing 
node  to  a  node  not  yet  in  the  graph.  This  insures  that  all  arcs  are  oriented  in  a  "forward*  direc¬ 
tion,  and  never  point  "back"  in  a  manner  that  would  generate  loops  in  the  graph  model. 

The  set  of  nodes  representing  the  possible  assignments  of  a  particular  module  to  all  pro¬ 
cessors  is  referred  to  as  a  stage ,  and  the  nodes  within  a  stage  are  the  possible  states  of  that 
stage. 
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There  is  one  stage  (the  first  stage  in  the  graph)  which  has  no  predecessor  stage.  A  single 
artificial  initial  node  s  is  added  as  a  predecessor  to  this  stage.  The  arcs  from  this  initial  node  to 
the  nodes  [ij\  in  the  first  stage  are  labeled  with  the  cost  of  executing  module  /on  processor  j. 

Several  stages  may  have  no  successor  stage,  therefore  a  final  node  t  is  added  to  the  graph 
model,  to  serve  as  a  single  termination  point.  Arcs  from  nodes  in  the  terminal  stages  to  this 
final  node  are  all  labeled  with  the  value  zero. 

The  model  so  constructed  is  called  a  search  graph.  The  algorithm  to  be  described  evalu¬ 
ates  the  states  in  every  stage  of  the  search  graph  with  the  goal  of  selecting  the  best  state  for 
each  stage  (that  is,  making  the  optimal  assignment  of  each  module  to  a  processor). 

Figure  1  depicts  a  program  of  m  -  5  modules  which  are  to  be  distributed  among  n  -  3 
processors.  The  program  graph  consists  of  nodes  representing  modules  and  arcs  representing 
the  pattern  of  intermodule  references.  Arc  labels  indicate  the  intermodule  reference  cost 
incurred  when  the  two  modules  run  on  different  processors.  These  costs  are  shown  in  the  com¬ 
munication  cost  matrix.  The  execution  costs  of  the  five  modules  on  the  three  processors  are 
given  in  tabular  form  in  the  execution  cost  matrix.  Finally,  the  directed  acyclic  graph  shows  the 
directions  arbitrarily  imposed  on  the  arcs  in  preparation  for  the  construction  of  the  search  graph. 
The  complete  search  graph  is  illustrated  in  Figure  2,  where  the  large  circles  represent  the  three 
possible  ways  of  assigning  each  of  the  five  modules,  and  the  arcs  are  labeled  according  to  the 
rules  specified  at  the  beginning  of  this  section. 

A  stage  with  multiple  successor  stages,  such  as  that  represented  by  module  A  in  the 
figure,  is  called  a  "fork"  stage.  A  "join"  stage  is  one  with  multiple  predecessor  stages,  such  as 
module  D  in  the  figure. 

The  search  algorithm  is  a  node-labeling  technique  consisting  of  two  phases,  a  labeling 
phase  and  an  assignment  phase: 

Labeling  Phase 

The  procedure  begins  with  the  terminal  node  /  and  advances  toward  the  initial  node  s,  and 
labels  all  of  the  nodes  according  to  the  following  rule. 

Labeling  Rule: 

CASE  1:  If  the  node  has  only  one  stage  as  successors,  then  label  the  node  with  the 
minimum  (over  all  successor  states)  of  the  sum  of  the  label  on  the  succes¬ 
sor  node  and  the  weight  on  the  arc  between  the  current  node  and  the  suc¬ 
cessor  node. 

CASE  2:  If  the  node  has  more  than  one  stage  as  successors,  then  compute  a  partial 
label  (using  the  rule  for  Case  1)  based  on  each  successor  stage,  and  add 
these  values  to  obtain  the  label  for  the  current  node. 

The  label  finally  obtained  for  the  initial  node  s  is  an  upper  bound  on  the  cost  of  the 
assignment  produced  by  the  algorithm.  (The  relation  between  this  upper  bound  and  the  actual 
cost  will  be  discussed  later  in  this  section.) 
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Assignment  Phase 

Once  the  labeling  process  is  complete,  the  actual  assignment  is  obtained  by  starting  with 
the  initial  node  s,  advancing  through  the  search  graph,  and  selecting  at  each  stage  the  successor 
node,  or  nodes,  used  in  the  Labeling  Phase  to  generate  the  label  for  the  current  node.  The  fol¬ 
lowing  rules  apply  to  each  node  already  "marked,"  that  is,  already  selected  to  be  in  the  assign¬ 
ment.  (The  initial  node  s  is  considered  to  be  trivially  a  part  of  the  assignment,  so  it  is  marked 
first.) 

Assignment  Rule: 

CASE  1:  If  the  node  has  only  one  successor  stage,  mark  the  successor  node  selected 
in  generating  the  label  for  the  current  node. 

CASE  2:  If  the  node  has  more  than  one  successor  stage,  mark  the  successor  node  in 
each  successor  stage  that  was  selected  to  contribute  to  the  label  on  the 
current  node. 

The  assignment  is  complete  when  the  terminal  node  /  is  marked.  The  subgraph  of  the 
original  g.aph  model,  which  consists  of  all  the  marked  nodes  and  the  arcs  connecting  them, 
constitutes  the  assignment  graph.  In  Figure  3,  the  assignment  graph  is  shown  in  bold  lines  and 
the  node  labels  are  written  inside  the  circles  denoting  nodes. 

The  optimality  of  the  assignment  resulting  from  this  search  procedure  is  dependent  upon 
the  structure  of  the  search  graph,  but  is  perhaps  best  analyzed  as  follows. 

Node  labeling  is  begun  at  the  terminal  node  and  advances  backward  until  the  initial  node 
is  labeled;  thus  the  procedure  is  known  as  backward  analysis  or  backward  recursion.  The  under¬ 
lying  dynamic  programming  concept  is  known  as  the  dynamic  programming  principle  of 
optimality  [1,  9,  15]:  that  the  optimal  policy  for  the  remaining  stages  (from  the  current  node  to 
the  terminal  node)  is  independent  of  the  policy  adopted  in  previous  stages  (from  the  initial 
node  to  the  current  node).  At  each  stage,  the  label  for  each  state  denotes  the  minimum  cost  of 
any  sequence  of  decisions  that  can  be  made  from  the  current  state  to  the  terminal  node.  The 
algorithm  therefore  produces  an  assignment  at  each  stage  by  evaluating  all  possible  states  in 
that  stage  and  selecting  the  most  attractive. 

When  a  node  n ,  has  successors  ny,  for  j  —  1 ,  . . .  ,  n,  which  all  belong  to  one  stage,  then 
this  represents  a  communication  between  just  two  program  modules.  It  suffices  to  label  node  n, 
with  the  minimum  of  the  sum  of  the  weight  on  the  arc  to  ny  and  the  label  on  node  ny,  since 
this  constitutes  the  optimal  decision  at  node  n,.  The  weights  on  the  arcs  are  such  that  the  inter¬ 
communication  costs  are  included  between  modules  on  different  processors,  and  not  when  two 
communicating  modules  reside  on  the  same  processor.  Thus  the  label  on  node  n ,  is  the  optimal 
cost  of  communicating  with  and  executing  the  module  represented  by  ny  plus  the  cost  of  com¬ 
pleting  all  remaining  modules.  This  is  Case  1  of  the  Labeling  Rule. 

When  node  n ,  has  successors  n}  belonging  to  more  than  one  stage  (i.e.,  when  node  n, 
represents  a  "fork"  stage),  then  the  module  represented  by  n,  is  communicating  with  more  than 
one  other  higher-numbered  module,  and  all  of  these  costs  must  be  considered.  Case  2  of  the 
Labeling  Rule  sums  the  minimized  costs  for  all  successor  stages  and  labels  node  nt  with  the 
result. 
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If  the  module  intercommunication  pattern  is  a  tree  rooted  at  the  initial  node  s,  then  an 
optimal  assignment  always  results  from  the  algorithm.  The  label  on  node  s  is  equal  to  the  cost 
of  the  optimal  assignment,  and  the  assignment  itself  is  obtained  during  the  Assignment  Phase 
of  the  algorithm. 

Because  the  search  graph  is  not  generally  a  tree,  and  more  than  one  stage,  say  B  and  C, 
may  share  the  same  successor  stage,  say  £>,  the  cost  from  stage  D  to  the  terminal  node  will  be 
included  in  the  labels  for  both  stages  B  and  C.  As  the  procedure  advances  upward  from  B  and 
C  to  a  common  predecessor  stage,  say  A,  the  labels  assigned  to  nodes  in  stage  A  will  be  com¬ 
puted  in  such  a  way  that  the  costs  from  successor  stage  D  downward  are  counted  twice.  Thus, 
in  this  case,  the  label  on  a  node  actually  represents  an  upper  bound  on  the  cost  from  that  node 
to  the  end  of  the  search  graph.  (It  should  be  noted  that  the  occurrence  of  an  inflated  label  at 
any  node  of  the  search  graph  does  not  prevent  the  algorithm  from  yielding  a  solution  that  is 
guaranteed  to  be  optimal.) 

A  couple  of  practical  considerations  are  addressed  at  this  point.  First,  in  a  general  search 
graph,  "ambiguous”  assignments  may  arise;  a  method  will  be  given  here  for  resolving  such 
ambiguities.  This  is  followed  by  a  discussion  of  the  second  problem:  that  encountered  in  cyclic 
search  graphs.  A  technique  will  be  described  which  guarantees  the  construction  of  acyclic 
models. 

In  the  problem  illustrated  in  Figure  3,  module  D  is  assigned  to  processor  1  because  the 
node  [DIJ  is  the  optimal  successor  to  both  node  [All  and  node  [Cl]  and  thus  no  conflict 
occurs.  In  contrast,  an  ambiguous  situation  is  illustrated  by  the  assignment  graph  in  Figure  4, 
in  which  the  optimal  successor  to  node  [£i]  is  [Z)l],  but  the  optimal  successor  to  node  [Cl]  is 
[D2].  Such  an  ambiguity  must  be  resolved  in  order  to  obtain  a  valid  assignment,  satisfying 
constraint  (c2). 

Let  X  and  Y  be  predecessor  stages  of  stage  Z.  Suppose  that  [2T/1  and  [  Yj]  are  optimal 
assignments,  and  that  [Zk]  is  the  optimal  successor  to  [ AT/' J ,  and  [Z/]  is  the  optimal  successor, 
to  [Yj].  Let  k(A/)  denote  the  label  on  the  node  [AI\.  Then  ambiguities  are  easily  resolved  by 
selecting  the  state  at  stage  Z  corresponding  to  the 

mm  [c^k  F  Cxz  F  Ckz  F  X(ZJc),  ezi  F  c%z  F  Cyz  F  X(Z/)}. 

In  general,  a  stage  Z  may  have  many  predecessor  stages,  each  suggesting  a  different  assignment 
at  stage  Z.  In  that  case,  each  possibility  must  be  evaluated  by  taking  the  sum  of: 

1.  the  execution  cost  of  the  module  associated  with  the  current  state, 

2.  the  sum  of  the  communication  costs  between  the  current  state  and  all  its  assigned 
predecessor  states,  and 

3.  the  label  on  the  current  state. 

The  least  costly  alternative  is  then  selected. 

It  happens  that,  for  the  graph  in  Figure  4,  an  optimal  assignment  results  from  applying 
the  above  rules  for  resolving  ambiguities.  However,  in  general,  when  ambiguities  arise,  the 
optimality  of  the  assignment  is  not  guaranteed.  If  D  is  any  "join"  stage  (such  as  in  Figure  4) 
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and  A  is  any  preceding  "fork”  stage,  then  the  label  on  a  node  in  stage  A  may  contain  incompati¬ 
ble  constituents.  In  the  cited  example,  the  left  constituent  is  based  on  state  1  in  stage  D ,  while 
the  right  constituent  is  based  on  state  2  in  stage  D.  Clearly,  constraint  (c2)  does  not  allow  both 
states  in  the  assignment,  therefore  the  labels  on  nodes  in  stage  A  are  not  necessarily  realistic. 

Note  that  the  presence  of  a  "fork"  and  "join"  substructure  within  the  search  graph  does  not 
necessarily  give  rise  to  ambiguities  and  therefore  need  not  compromise  the  optimality  of  the 
assignment. 

The  effect  of  the  structural  properties  of  the  search  graph  on  the  optimality  of  the  solution 
produced  by  the  shortest  path  algorithm  described  in  this  section  may  be  summarized  as  fol¬ 
lows.  The  graph  may  represent  intermodule  reference  patterns  that  are  trees  or  not  trees.  In 
those  that  are  trees,  ambiguities  never  arise  and  optimality  is  guaranteed.  In  those  that  are  not 
trees,  ambiguous  assignments  may  or  may  not  arise.  If  ambiguities  do  not  arise,  optimality  is 
guaranteed.  In  ambiguous  situations,  an  optimal  assignment  may  or  may  not  result.  (The  algo¬ 
rithm  does  identify  those  ambiguous  situations  for  which  optimality  is  obtained.) 

The  solution  approach  used  in  this  section  is  basically  a  shortest  path  method,  and  it 
should  be  noted  at  this  point  that  the  problem  of  resolving  ambiguities  is  the  primary  motiva¬ 
tion  for  the  use  of  a  backward  shortest  path  method  rather  than  a  forward  method  such  as 
Dijkstra’s  labeling  algorithm  [7,  8,  10].  Forward  and  backward  methods  differ  in  the  auxiliary 
information  they  produce.  The  backward  procedure  described  in  this  section  generates  explicit 
information  about  optimal  paths  from  particular  nodes  to  the  terminal  node  and  this  is  precisely 
the  information  used  in  the  resolution  of  ambiguities. 

The  search  graph  model  which  has  been  described  is  an  acyclic  directed  graph.  If  cycles 
were  allowed  in  the  search  graph,  two  difficulties  could  arise.  One  of  the  problems  could  be 
solved  by  a  modification  to  the  algorithm,  but  the  other  might  cause  the  procedure  not  to  ter¬ 
minate. 

The  first  problem  is  that  a  node  may  be  assigned  a  label  at  one  point,  and  later,  since  it  is 
its  own  predecessor,  receive  a  second,  larger  label.  The  labels  on  the  nodes  in  any  simple  path 
from  the  terminal  node  to  the  initial  node  in  an  acyclic  search  graph  form  a  partially  ordered 
chain,  since  if  node  [A! I  is  an  immediate  predecessor  of  node  [A/],  then  k(Al)  >  A (BJ)  by 
an  amount  equal  to  egj  if  /  -  /,  and  by  an  amount  equal  to  (eu  +  cAB)  if  I  *  J.  This  pro¬ 
perty  is  lost  if  cycles  are  allowed.  A  node  in  s  cycle  may  be  labeled  twice,  and  the  final  label 
will  in  general  be  larger  than  the  labels  on  the  node's  predecessors  in  the  cycle.  The  most 
recent  label  is  the  one  that  should  be  permanently  assigned  to  that  node;  however  care  must  be 
taken  not  to  label  a  chain  of  predecessors  of  a  node  more  than  once  in  the  algorithm. 

The  more  serious  problem  arises  when  a  module  having  multiple  predecessor  stages  is 
given  an  ambiguous  assignment,  during  the  assignment  phase,  and  is  involved  in  a  cycle.  In 
this  case,  in  the  evaluation  of  a  particular  alternative  state  in  the  ambiguous  stage,  the  label  of 
the  state  may  be  dependent  upon  selection  of  a  different  predecessor  state  at  that  same  stage. 
Thus  the  evaluation  process  may  require  indefinite  backtracking  through  the  cycle,  in  an 
attempt  to  resolve  an  unresolvable  ambiguity. 

Cyclic  search  graph  models  are  avoided  simply  by  arranging  the  input  to  the  algorithm  so 
that  the  modules  are  introduced  in  topological  order.  Information  about  the  execution  and 
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communication  costs  of  module  /  is  introduced  before  the  information  about  module  j  if  and 
only  if  /  >  j.  Thus  the  graph  model  is  constructed,  beginning  with  the  terminal  node,  and 
directions  are  imposed  on  the  arcs  in  such  a  way  that  an  arc  is  never  directed  from  a  node  in 
stage  j  to  a  node  in  stage  >  unless  /  >  j. 

4.  BRANCH-AND-BOUND  ALGORITHM 

The  algorithm  described  in  the  preceding  section  generally  yields  low-cost  assignments  of 
modules  to  processors,  but  has  two  characteristics  which  become  serious  disadvantages  when 
the  search  graph  is  large,  that  is,  when  modules  of  a  very  large  distributed  program  are  to  be 
assigned  among  processors  in  a  large  network.  First,  an  evaluation  (in  the  form  of  a  computed 
label)  is  made  of  every  node  in  the  graph,  and,  second,  the  entire  search  graph  must  be  main¬ 
tained  in  memory  to  allow  a  final  assignment  to  be  made.  It  is  therefore  reasonable  to  attempt 
to  modify  the  technique  so  that  it  is  less  costly  in  its  consumption  of  time  and  memory  space, 
yet  still  produces  a  reasonably  low  cost  assignment  of  modules  to  processors.  This  compromise 
represents  an  effort  to  minimize  the  combined  expense  of  the  assignment  and  the  process  used 
to  obtain  the  assignment. 

The  graph  model  to  be  used  as  the  search  graph  is  the  same  one  described  in  the  previous 
section.  A  forward  search  procedure  is  given  that  begins  with  the  initial  node  and  searches 
selected  portions  of  the  graph  to  obtain  a  solution.  The  following  three  considerations  contri¬ 
bute  to  the  efficiency  of  the  technique: 

a.  An  evaluation  function  is  applied  to  the  nodes  in  order  to  estimate  the  "promise’  of  a 
node.  This  criterion  is  used  to  direct  the  search  to  the  nodes  thought  to  be  best. 

b.  Not  all  of  the  successors  of  a  node  are  generated.  Since  it  is  unlikely  that  an  optimal 
assignment  would  place  a  module  on  a  processor  poorly  suited  for  it,  only  those  nodes 
representing  assignments  with  lower  execution  costs  are  considered. 

c.  In  order  to  reduce  the  total  storage  requirements,  final  assignment  decisions  are  made 
within  certain  regions  of  the  graph,  thus  allowing  upper  parts  of  the  search  graph  to  be  dis¬ 
carded  and  making  space  available  for  use  by  deeper  parts  of  the  graph. 

The  heuristic  procedure  used  here  is  a  nonbacktracking  branch-and-bound  procedure  that 
uses  a  local  neighborhood  search  at  each  stage  (21,  25].  An  important  characteristic  of  this  pro¬ 
cedure  is  that  a  feasible  (and  always  improving)  solution  is  available  at  all  times,  in  case  early 
termination  of  the  assignment  algorithm  is  necessitated. 

The  following  definitions  will  be  used  in  describing  the  algorithm. 

it  denotes  any  valid  assignment  graph  (representing  any  feasible  assignment). 

S  is  the  solution  space,  defined  as  {all  n}. 

ir  i  denotes  a  partial  assignment  graph  in  which  modules  1  through  /  have  been  given 

permanent  assignments  and  the  remaining  modules  have  tentative  assignments. 

w,k  is  the  partial  assignment  ir,  in  which  module  /  is  assigned  to  processor  k. 

irm  represents  a  complete  assignment  graph,  where  m  is  the  number  of  modules. 
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Lin,)  denotes  the  cost  of  a  partial  assignment  w,  in  which  communication  costs  are  con¬ 
sidered  for  module  1  through  module  /  but  not  for  modules  i  +  1  through  m. 

/(ir,)  denotes  the  total  cost  of  execution  and  communication  for  a  partial  assignment  ir,. 

f(irm )  denotes  the  total  cost  of  a  complete  assignment  of  all  m  modules,  that  is,  the  total 

cost  of  executing  the  distributed  program  for  the  assignment  nm. 


N(iru)  represents  a  neighborhood  of  ir„  defined  as 

(ir*  I  e,k  <  median  of  ev  for  j  —  1,  ....  «}. 

The  neighborhood  of  a  partial  assignment  is  therefore  simply  that  set  of  partial 
assignments  ir,  obtained  by  varying  the  decision  at  stage  i  among  that  half  of 
the  states  for  which  the  execution  cost  of  the  module  i  is  least. 

Note  that  L  (ir,)  is  a  lower  bound  on  the  cost  function  /  since 
Kir,)  <  /(it,)  for  1  <  /  <  m  —  1 
and 

Kirm)  -  f(nm). 

If  j  is  a  successor  stage  of  i,  then 
L(irj)  >  Kir,) 

therefore  the  lower  bound  increases  as  the  search  proceeds  through  the  graph. 

The  algorithm  utilizes  the  lower  bound  Kir,)  for  /  -  1,  ....  m.  Heuristic  information 
may  be  used  to  establish  this  bound  since  /(ir,)  is  generally  not  known  until  the  search  is  com¬ 
plete.  It  is  desirable  that  a  lower  bound  be  as  close  as  possible  to  the  actual  cost  function 
/(ir,),  but  it  is  also  necessary  that  L  be  simple  to  compute  or  estimate. 

The  bounding  strategy  selected  here  is  the  use  of  a  modified  objective  function  in  which 
the  execution  costs  (the  linear  terms)  are  considered  but  the  communication  costs  (the  qua¬ 
dratic  terms)  are  ignored  until  they  are  easily  available.  Thus,  initially,  the  lower  bound  value 
used  is 


Kiri)  -  £  («nin(e</)) 

/-i  J 

The  original  problem  constraints  are  not  relaxed,  so  the  initial  lower  bound  is  just  the  cost  of 
executing  each  module  on  the  processor  for  which  its  execution  cost  is  least.  As  communica¬ 
tion  costs  are  encountered  at  each  stage  in  the  search  graph,  they  are  included  in  the  lower 
bound  I;  that  is,  terms  are  added  to  the  objective  function  to  yield  a  more  realistic  assessment 
of  the  cost.  Finally,  the  original  complete  objective  function  is  generated  and  L  (irm)  -  f(irm). 

A  formal  description  of  the  algorithm  is  given  in  Appendix  A.  The  notation  appears  at 
first  to  be  slightly  cumbersome,  but  is  justified  since  it  is  a  simplification  of  a  broader  scheme 
(18,  19]  used  to  characterize  a  general  class  of  branch-and-bound  algorithms,  and  therefore 
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allows  the  technique  to  be  defined  in  a  standard  framework.  It  should  be  clear  that  the  algo¬ 
rithm  can  be  stated  quite  simply  in  terms  of  the  search  graph  defined  in  Section  3.  The  search 
begins  at  the  initial  node  s  and  at  each  stage  selects  the  state  indicated  by  taking  the  arc  with 
the  smallest  label.  Ambiguities  are  resolved,  as  in  Section  3,  on  the  basis  of  local  information 
(i.e.,  information  at  the  current  stage  and  immediate  predecessor  stages). 

Several  considerations  pertinent  to  a  practical  implementation  of  the  algorithm  are  men¬ 
tioned  here: 

A  table  of  module  execution  times  is  created  before  the  search  begins.  This  information 
is  used  to  establish  the  initial  solution  which  is  based  solely  on  execution  times.  Module  inter¬ 
communication  costs  are  made  available  to  the  algorithm  as  the  search  progresses. 

A  list  of  "active"  modules  may  be  maintained  that  contains  information  about  modules 
that  have  been  generated  by  the  Branching  Rule  but  not  yet  abandoned  by  the  Deletion  Rule. 

Implementation  of  the  policy  Q  for  selecting  the  partial  assignment  i tj,  such  that 
Liitj,)  <  L(irJk )  for  all  nJk€  N(irj\) 

is  accomplished  simply.  Note  that  the  values  of  all  the  L  (irj)  differ  only  in  the  execution  costs 
eJS,  where  s  -  1' . k'  and  k'  -  n/2’,  and  the  communication  cost  ctj  where  i  is  the  prede¬ 

cessor  stage  of  state  j.  Thus,  the  least  lower  bound  is  selected  by  considering  the  states  jY 
through  jk'  and  choosing  the  one  corresponding  to 

5  “  i' min  *- 1 eJ *  +  c'^' 

Ties  may  be  broken  arbitrarily. 

Figure  S  illustrates  the  result  of  applying  this  technique  to  the  search  graph  shown  in  Fig¬ 
ure  2.  It  happens  in  this  case  that  the  branch-and-bound  algorithm  does  produce  an  assignment 
that  is  optimal. 

5.  COMPUTATIONAL  RESULTS  AND  SUMMARY 

Implementations  of  the  two  algorithms  described  in  Sections  3  and  4  (plus  an  algorithm 
which  performs  an  exhaustive  enumeration  of  all  possible  assignments)  were  developed  for  per¬ 
formance  analysis.  Random  test  data  for  experimentation  were  systematically  generated  for 
hypothetical  networks  of  S,  10,  IS,  and  20  processors  and  5,  10,  IS,  and  20  program  modules. 
Nine  different  networks  were  generated  for  each  problem  size  (m,  n).  Of  interest  are  the  com¬ 
putation  time  requirements  for  each  problem  and  the  cost  (i.e.,  objective  function  value)  of  the 
assignment  produced.  Average  computation  times  for  the  algorithms  are  reported  in  Table  1. 

The  shortest  path  labeling  algorithm  requires  roughly  n2m  additions  and  (n  -  1) nm 
comparisons  (if  no  ambiguities  occur)  and  therefore  should  run  in  0 (n2m)  time.  However,  in 
most  of  the  test  cases,  ambiguities  did  arise  and  computation  time  on  these  sample  networks 
was  0(n2m2). 

The  nonbacktracking  branch-and-bound  search  requires  nm  additions  and  (n  -  l)m  com¬ 
parisons,  and,  as  expected,  the  computation  time  exhibited  on  the  test  cases  was  0(nm). 
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TABLE  2  —  Computation  Times 


/V-  5 

N-  10 

N-  15 

N-  20 

M-  5 

.0156 

.0400 

.0727 

.1220 

.002 

i 

.058 

.008 

.0102 

.0216 

A/  -  10 

.189 

.373 

.654 

.0083 

.0152 

.0304 

.044 

1 

> 

.142 

.443 

.938 

1.61 

.0124 

.030 

.048 

.074 

a: 

i 

K> 

o 

.266 

.835 

1.68 

2.95 

.022 

.042 

.068 

.103 

For  each  problem  size,  the  computation  time  for  the  shortest 
path  algorithm  is  shown  above  the  time  for  the  branch-and- 
bound  algorithm. 

Compulation  times  are  given  in  seconds. 


The  quality  of  the  solutions  produced  by  the  two  algorithms  varies  depending  on  the 
nature  of  the  network.  In  sample  networks  having  execution  costs  and  communication  costs  of 
the  same  order  of  magnitude,  the  shortest  path  algorithm  outperforms  the  branch-and-bound 
method  slightly.  The  difference  in  performance  is  less  pronounced  in  networks  in  which  execu¬ 
tion  costs  dominate  communication  costs.  In  these  cases,  the  cost  of  the  assignment  produced 
by  these  algorithms  never  exceeds  twice  the  cost  of  an  optimal  assignment.  However,  when 
communication  costs  greatly  exceed  execution  costs,  both  algorithms  perform  poorly  (in  some 
cases  yielding  a  cost  as  high  as  S  times  optimal  cost)  and  the  branch-and-bound  algorithm,  of 
course,  is  the  worse  of  the  two.  This  performance  is  not  surprising  in  a  network  dominated  by 
communication  costs.  Recall  that  in  the  objective  function  of  the  problem  formulation  (section 
2),  communication  costs  give  rise  to  the  nonlinear  terms,  and  it  is  precisely  the  nonlinearity  of 
the  objection  function  that  makes  this  particular  scheduling  problem  a  difficult  one  to  solve 
exactly. 

There  have  been  no  published  reports  of  computational  experiments  with  other 
approaches  to  this  particular  scheduling  problem,  so  there  is  no  information  available  on  which 
to  base  estimates  of  relative  efficiency. 

The  scheduling  problem  considered  here  has  an  efficient  solution  for  n  ■»  2  [29].  It  has 
been  shown  (30]  that  the  problem  is  NP- hard  for  n  >  4.  This  property  has  not  been  esta¬ 
blished  for  the  3-processor  case. 

There  are  nm  possible  assignments  to  be  considered  in  this  scheduling  problem  and, 
indeed,  actual  computational  experiments  using  an  enumerative  algorithm  require  time  that  is 
0 (/»"*) .  The  shortest  path  and  branch-and-bound  algorithms  are  of  low  polynomial  complexity 
but  generally  produce  suboptimal  solutions. 

The  problem  of  assigning  modules  of  a  computer  program  among  processors  in  a  distri¬ 
buted  computer  network  has  been  investigated  and  formalized  here  as  a  zero-one  quadratic  pro¬ 
gramming  problem  with  linear  constraints.  This  particular  scheduling  problem  has  received 
relatively  little  attention,  and  several  interesting  topics  remain  for  further  study. 
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a.  Some  effort  should  be  directed  to  studying  the  way  in  which  the  execution  costs  etJ 
(which  represent  affinity  values  for  module  i  to  processor  J)  are  determined.  A  reasonable 
approximation  to  these  costs  may  be  obtained  by  assuming  that  eu  is  some  fixed  constant  times 
eik>  where  the  fixed  constant  is  some  predetermined  value  which  relates  the  overall  speed  and 
processing  power  of  processors  j  and  k.  This,  of  course,  ignores  the  nature  of  module  L  Some 
experiments  might  be  performed  to  show  how  additional  information  may  be  gathered  (for 
example,  from  previous  runs)  to  refine  initial  estimates  of  relative  processor  performance. 

b.  A  similar  analysis  ought  to  be  made  for  the  problem  of  establishing  intermodule  refer¬ 
ence  costs.  Initially  such  costs  might  be  assigned  based  solely  on  the  number  of  intermodule 
references,  assuming  that  each  reference  incurs  the  same  cost.  However,  since  a  control  pro¬ 
gram  must  intervene  to  perform  intermodule  transfers,  information  could  be  gathered  to 
improve  the  original  estimates. 

c.  Special  problems  arise  in  the  determination  of  execution  costs  if  modules  are  allowed 
to  call  themselves  recursively,  and  also  in  the  determination  of  communication  costs  if  a  non- 
deterministic  referencing  environment  is  permitted  (i.e.,  if  address  calculations  are  allowed  dur¬ 
ing  execution).  A  consideration  of  both  of  these  issues  introduces  an  element  of  program  data 
dependency  which  complicates  the  model. 

d.  A  module  should  not,  in  practice,  be  assigned  to  a  processor  that  is  overloaded,  even 
though  that  processor  may  be  the  one  best  suited  for  the  module  [28].  Processor  load  balanc¬ 
ing  might  be  included  in  the  branch-and-bound  procedure  through  a  modification  of  the 
Branching  Rule.  A  neighborhood  could  be  redefined  so  that  it  is  based  not  just  on  execution 
times,  but  rather  on  some  combination  of  execution  costs  and  processor  availability.  A  work¬ 
able  measure  of  current  processor  load  must  be  developed  which  can  be  incorporated  into  the 
existing  model. 

The  search  for  a  solution  to  this  problem  has  included  a  consideration  of  various  tech¬ 
niques  such  as  multi-terminal  cut  trees  [13],  spanning  trees  [20],  and  cluster  analysis  [34],  in 
addition  to  the  approaches  described  in  the  preceding  sections.  Scheduling  problems  generally 
rely  on  a  variety  of  techniques  from  the  areas  of  linear  and  nonlinear  programming,  network 
analysis,  and  graph  theory.  It  appears  that  until  fundamental  theoretical  advances  are  made, 
practical  developments  will  consist  of  heuristic  methods  and  combinations  of  algorithms  from 
diverse  fields. 


APPENDIX  A 

The  nonbacktracking  branch-and-bound  algorithm  can  be  characterized  by  a  seven-tuple 
(/,  Bn,  S0.  L,  P.D.R) 
for  which  parameters  are  defined  as  follows: 

1.  Initial  Solution  Rule.  An  initial  solution  is  obtained  by  assigning  each  module  to  the 
processor  for  which  its  execution  cost  is  least. 

2.  Branching  Rule.  The  branching  process  extends  a  partial  assignment  v,  into  partial 

assignments  trJk  where  j  is  a  successor  stage  to  i  and  k  -  1' . n/2\  and  nodes  j  V  through 
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j  n/T  represent  that  half  of  the  states  in  stage  j  having  the  lowest  exectution  costs.  Thus,  an 
application  of  this  rule  generates  the  neighborhood  N(irn),  where  jV  is  the  initial  state  at 
stage  j.  The  branching  rule  must  be  applied  to  all  successor  stages  of  the  branching  node  in 
stage  L 

3.  Selection  Rule.  The  selection  of  the  next  branching  node  is  based  upon  a  search  within 
the  neighborhood  N(iru)  of  the  state  representing  the  initial  assignment  of  module  i  to  proces¬ 
sor  j.  The  policy  Q  for  making  the  selection  is  to  choose  that  ira  such  that 

L(iru)  <  L(irlk)  for  all  irlk  €  N(wij). 

4.  Lower  Bound  Function.  A  lower  bound  on  the  cost  of  partial  assignment  n,  is  L(n ,), 
as  defined  in  the  list  of  definitions  in  Section  4. 

5.  Permanent  Assignment  Criterion.  If  the  current  stage  i  has  predecessor  stages  which  do 
not  have  a  permanent  assignment,  then  the  assignment  of  the  current  stage  remains  tentative 
even  though  the  selection  rule  has  been  applied  and  an  assignment  decision  made.  At  a  "join” 
point,  a  selection  process  must  be  applied  to  the  stage  for  every  predecessor  stage,  and  ambigui¬ 
ties  resolved  if  necessary.  Only  then  may  the  assignment  at  stage  i  be  considered  permanent 
and  the  assignment  n,  established. 

6.  Deletion  Rule.  A  stage  i  may  be  deleted  from  the  search  graph  when  all  successor 
stages  have  permanent  assignments,  that  is,  when  irk  has  been  established  for  all  successor 
stages  k  of  stage  /. 

7.  Resource  Limitations  Vector.  The  components  of  this  vector  may  be  used  to  control  the 
extent  of  the  search,  by  giving  an  indication  of  the  availability  of  the  resources  required  during 
the  search.  An  upper  bound  on  algorithm  execution  time  may  be  checked  and  the  algorithm 
halted  if  time  is  exceeded.  An  upper  bound  on  available  storage  may  be  used  to  vary  the 
breadth  of  the  search  at  each  stage.  The  size  of  the  neighborhood  N(wi;/)  generated  by  the 
Branching  Rule  may  be  increased  or  decreased  appropriately. 
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ABSTRACT 

Position  finding  has  historically  been  carried  out  by  calculating  the  coordi¬ 
nates  of  the  mean  position  via  a  least-squares  procedure  based  on  the  distance 
of  the  position  from  several  direction  lines.  It  has  been  suggested  that  the 
least-squares  procedure  assigns  too  much  weight  to  outliers  among  the  set  of 
direction  lines,  outliers  which  may  actually  be  associated  with  objects  other  than 
the  one  being  located.  In  this  paper,  a  method  of  using  least-absolute  devia¬ 
tions,  which  yields  a  more  outlier-resistant  median  estimate  of  the  position  in¬ 
stead  of  the  least-squares  mean  estimate,  is  presented. 


1.  INTRODUCTION 

Position  finding  is  the  process  of  Jocating  an  object  on  the  earth's  surface  by  amalgamating 
the  observation  (or  "direction")  lines  to  the  supposed  position  made  by  various  observers  scat¬ 
tered  distances  from  the  true  position.  If  all  observers  could  construct  their  direction  lines 
without  error,  all  lines  would  theoretically  intersect  in  the  true  position.  In  reality,  of  course, 
the  direction  lines  have  a  certain  amount  of  error  associated  with  them,  and  this  circumstance 
necessitates  a  statistical  treatment  of  the  subject. 

The  statistical  foundations  of  position  finding  were  developed  in  1951  by  H.E.  Daniels  [lj. 
Daniels  presented  the  following  assumptions  as  valid  in  most  cases  of  practical  interest: 

(i)  the  earth  is  flat  near  the  true  position; 

(ii)  the  direction  lines  are  straight  lines; 

(iii)  observation  errors  displace  the  direction  line  parallel  to  itself. 

Daniels  represented  the  equations  of  the  direction  lines  in  terms  of  an  angle  of  inclination  tfi 
and  the  perpendicular  distance  p  of  the  line  from  the  origin.  In  line  with  current  practice  (see 
Figure  1)  this  makes  the  equation  of  the  direction  line 

—x  cos  <f>  +  y  sin  <t>  -»  p. 

So,  in  general,  we  have  a  sequence  of  N  direction  lines,  one  equation  -xcos  <t>j  +  y  sin  ** 

Pj  for  each  value  of  j  -  1,  2 . N.  If  we  denote  the  true  location  of  the  object  in  question 

by  (f.  vj),  then  the  perpendicular  distance  from  the  point  (f,  rj)  to  the  j th  direction  line  is 
If  cos  tftj-  Tj  sin  <f>j  +  pj\.  Assuming  that  the  parallel  displacement  error  of  the  Jr  th  direction 
line  is  a  random  variable  with  mean  0  and  variance  trf ,  Daniels  defined  the  "least-squares  fix* 
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to  be  the  point  (TM,  Y*)  that  minimizes  the  sum  of  standardized  squared  perpendicular  dis¬ 
tances 

\ 

fMx,  y)  —  £  a"2 Or  cos  4>j  —  y  sin  <f>j  +  Pj)1. 

J- 1 

We  divide  the  /h  squared  distance  by  <rj  because  we  want  the  least  erroneous  direction  lines  to 
count  most  heavily  in  the  sum.  The  least-squares  fix  has  the  desirable  property  that,  if  all  the 
o-/s  are  equal,  the  vector  sum  of  distances  from  (X*,  Y*)  to  each  of  the  direction  lines  is  0. 
Furthermore,  "normal  equations"  and  explicit  formulas  for  and  Y*  can  be  found  by  the 
usual  partial  differentiation  techniques.  See  Daniels  [1]  for  further  details. 

Recently,  it  has  been  suggested  that  the  least-squares  procedure  assigns  too  much  weight 
to  outliers  among  the  set  of  direction  lines,  outliers  which  may  actually  be  associated  with 
objects  other  than  the  one  being  located.  Such  misassociations  are  common  when  two  or  more 
objects  are  relatively  close  together.  Suppose,  for  example,  that  a  set  of  100  direction  lines 
ostensibly  fixed  on  object  A  actually  contains  20  which  are  fixed  on  object  B.  The  least-squares 
approach  weights  all  100  threesomes  pJt  a-j)  equally  and  does  not  attempt  to  isolate  the  20 
outliers.  Because  the  least-squares  fix  is  essentially  a  mean  of  all  100  direction  lines,  we  should 
not  be  surprised  if  the  fix  were  located  20%  of  the  way  from  object  A  to  object  B  along  the  line 
joining  them.  We  would  then  be  making  a  possibly  serious  error  in  asserting  that  object  A  was 
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located  at  the  'east-squares  fix.  Our  objective  in  this  paper  is  to  present  a  method  of  reducing 
the  effect  of  outfits  by  using  least-absolute  deviations  rather  than  least-squared  deviations. 
This  ODroach  yields  a  median  estimate  of  the  true  position  which  is  more  outlier-resistant  than 
the  traditional  least-squares  mean  estimate. 

To  find  the  “least-absolute-deviations  fix”  ( X *,  T*),  we  have  to  minimize  the  sum  of 
standardized  perpendicular  distances 

N 

£*(x.  y)  *»  £  tr“'  |x  cos  <l>j  -  y  sin  <bj  +  p,|. 
j- 1 

The  fact  that  the  absolute  value  function  is  not  differentiable  in  a  neighborhood  of  its  minimum 
makes  the  usual  approach  impossible.  Partial  differentiation  techniques  alone  cannot  be  used  to 
minimize  £*(x,  y),  and  therefore,  there  are  no  normal  equations  and  no  explicit  formulas  for 
T‘and  Y*.  Where  do  we  begin? 

We  turn  to  the  regression  literature  for  ideas  on  how  to  proceed.  There  is  a  large  number 
of  papers  dealing  with  "least-absolute-deviations  regression.”  Examples  are  those  of  Singleton 
[13],  Karst  [6],  Sharpe  [11],  Forsythe  [31,  and  Schlossmacher  [10].  Absolute-deviation  minimi¬ 
zation  problems  have  been  expressed  and  solved  in  terms  of  linear  or  convex  programming  by 
Wagner  [14],  Davies  [2],  Rao  and  Srinivasan  [9],  and  Narula  and  Wellington  [7],  among  oth¬ 
ers.  Various  competing  methods  are  compared  in  Harvey  [4],  Hill  and  Holland  [5],  and  Narula 
and  Wellington  [8],  Unbiased  estimation  is  discussed  in  Sielken  and  Hartley  [12]. 

In  two  dimensions,  the  least-absolute-deviations  regression  problem  is  to  minimize 

(1)  E'(m,  b)  —  £  |yy  -  mxJ  -  H>| 

j- 1 

while  the  position-finding  problem  is  to  minimize 

£*(x,  y)  -  £  Ixo-,-1  cos  4>j  -  yo-J1  sin  <t>j  +  I 

i~\ 

•V 

[2)  -  £  a-,-1  sin  <t>j\x  cin  4>j  -  y  +  pj  esc  <j>yl 

which  can  be  viewed  as  a  "weighted"  least-absolute-deviations  regression  problem. 

If  the  observation  errors  are  distributed  according  to  the  normal  distribution  (and  there 
are  no  outliers  other  than  those  anticipated  by  this  assumption),  then  Daniels  [1 1  noted  that  the 
least-squares  fix  is  the  maximum  likelihood  estimator  of  the  true  position.  While  the  mqor 
focus  of  this  paper  is  the  problem  of  unanticipated  outliers,  it  is  perhaps  of  some  interest  to 
mention  a  maximum  likelihood  property  of  the  least-absolute-deviations  fix.  Suppose  we  set 

«,  -  x  cos  ib  i  -  y  sin  <t>,  +  Pj 

to  be  the  observation  error  in  the  /h  position  line,  and  we  assume  that  * j  has  the  Laplace  (or 
"double  exponential")  distribution  with  mean  0  and  variance  it}.  Then  «;  has  the  probability 
density  function 

./;(«)  -  — Ur  exp [  — 1« IV5/o-  }. 
cr  ;vz 
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If  there  are  no  outliers  other  than  those  consistent  with  the  Laplace  distribution,  the  max¬ 
imum  likelihood  estimator  of  the  true  position  is  obtained  by  maximizing  the  log-likelihood 
function 

!(«,,  «2,  ...  ,  eN)  -  log(o-;>/2)  -  y/2  £  trJx\tj\. 

J- 1  J- 1 

To  maximize  this  is  to  minimize  the  second  term  (the  only  term  containing  x  and  y).  But  this 
is  equivalent  to  minimizing  Equation  (2).  Therefore,  the  least-absolute-deviations  fix  is  the 
maximum  likelihood  estimate  of  the  true  position  if  the  observation  errors  have  the  Laplace 
distribution. 

2.  THE  MINIMIZATION  PROCESS 

Referring  to  the  expression  E*(x,  y)  in  Equation  (2),  we  define 

(3)  Ej(x,  y)  —  x  eta  <t>j  -  y  +  pj  esc  <f>j. 

Without  loss  of  generality,  we  may  choose  our  origin  of  coordinates  in  such  as  way  that,  for 
each  j  -  1,  2,  ...  ,  N, 

0  <  <t>j  <  180° 

(4)  Pj  >  0. 

If  we  fix  x  and  consider  E'(x,  y)  to  be  a  function  of  y  alone,  then  E'(x,  y)  represents  a  con¬ 
vex  broken  line  in  the  (y,  E*)  plane  with  a  single  vertex  at  the  point 

(5)  yj(x)  -  pj  esc  4>j  +  x  ctn  4>j,  E *  »  0. 

Because  sin  <f»j  ^  0  by  Equation  (4),  the  absolute  error 

(6)  E*(x,  y)  -«  £  o'/Osin  <f>j)\Ej(x,  y)| 

J- 1 

is  a  sum  of  broken  lines  and  is  therefore  a  piecewise-linear  convex  function  of  y,  having  N  ver¬ 
tices,  one  vertex  lying  above  each  of  the  horizontal  axis  points  of  Equation  (S).  See  Figure  2 
for  an  illustration. 

Because  E*(x,  y)  is  a  piecewise-linear  convex  function  in  y  for  fixed  x,  it  has  a  minimum 
which  occurs  at  one  of  the  points  yj(x)  of  Equation  (5).  Its  partial  derivative  with  respect  to  y, 
namely 

a  N 

(7)  cr7'(sin  <£y)  sgn  Ej(x,  y) 

exists  almost  everywhere  (y),  in  fact,  everywhere  except  at  those  values  of  y  in  Equation  (5) 
for  which  sgn  E’ Or,  y)  -  0.  Here 

sgn  9  -  1  if  9  >  0 

-  0  if  9  -  0 

-  -  1  if  9  <  0. 

Consider  a  particular  value  of  x  -  x0,  and  redefine  the  indices  j  “  1,  2,  ....  N  so  that 
the  values  y/x 0)  are  listed  in  increasing  order  from  left  to  right  across  the  graph  of  Figure  2. 
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Figure  2.  Absolute  error  £*as  a  function  of  y 


The  value  of  yj(xQ),  call  it  ^(xq),  at  which  the  minimum  of  E*(x0,  y)  occurs  is  at  the  vertex  in 
Equation  (S)  satisfying  the  equation 

/ox  /  /  xx  ( d£*(x0,  y) 

(8)  sgn  (y  -  y,(x0))  -  sgn - — - 

for  all  y.  For  simplicity  of  notation,  we  set 

(9)  y0  -  jv(x0). 

Then  y/x0)  <  y0  for  j  -  1,2,...,  r-  1,  while  yj(x0)  >  y0  for  j  -  r,  r  +  1,  ...  ,  N.  It  then 
follows  from  Equations  (3)  and  (5)  that 

(10)  sgn  Ej(x0,  y0)  “  sgnQ  -  r) 
so  that  Equation  (6)  yields 

N 

E*(x o,  y0)  -  o-y-l(sin  <^)£}*(x0,  >>o)  sgn  (J  -  r) 

ATq  +  GqXq 
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where 

N 

01)  G0  -  £  <7~'(cos  <l>j)  sgn  O' r) 

j- 1 

£*(x o.  ^o)  is  the  minimum  possible  value  of  £*(x,  y)  when  x  —  x0. 

The  value  of  E*(x,  y)  can  be  reduced  further  by  an  iterative  scheme.  Given  a  value  of 
xk  (beginning  with  Ac  —  0,  perhaps  taking  the  least-squares  x- value  X*  as  x0),  we  obtain  yk  — 
yr(xk )  by  the  process  leading  to  Equation  (9).  The  choice  of  yk  defines  a  value  of  r  —  rk  and 
we  then  calculate  Gk  using  Equation  (11)  with  r  replaced  by  rk. 

If  Gk  >  0,  we  choose  xk+i  <  xk,  and,  if  Gk  <  0,  we  choose  x*+]  >  xk.  The  value  x*+1 
should  then  be  taken  as  small  as  possible  (if  Gk+t  >  0)  or  as  large  as  possible  (if  Gk+t  <  0) 
without  upsetting  the  order  of  the  values  yy(x*+ 1),  and  so  without  changing  G*+i  further. 

The  iteration  ends  when  two  successive  points  (x*,  yk)  and  (x*+),  y*+1)  are  sufficiently 
close  to  each  other,  namely  until  \xk  -  x*+1|  +  \yk  -  yt+1|  is  sufficiently  small. 

Equation  (2)  graphs  into  a  convex  polyhedron  in  the  three-dimensional  space  of  coordi¬ 
nates  (x,  y.  £•)  and  therefore  the  iteration  process  must  converge.  Some  diagrams  of  this  are 
presented  by  Karst  [6].  As  Karst  shows,  the  minimum  value  of  £*(x.  y)  will  occur  either  at  a 
unique  point  (X\  K*),  at  all  points  of  a  unique  line  segment  in  the  (x,  >0 -plane,  or  at  all 
points  of  a  unique  simply-connected  planar  region.  In  all  practical  situations,  the  minimum  will 
occur  at  a  unique  point.  Special  situations,  however,  can  be  contrived  to  result  in  solution  line 
segments  or  planar  regions.  One  example  of  this  will  be  presented  in  the  next  section,  although 
the  likelihood  of  such  special  geometries  occurring  in  practice  is  negligible. 

One  more  comment:  if  there  are  /V  direction  lines,  there  will  be  |^j  —  y  N(N  -  1) 

intersection  points  ("two-line  fixes").  In  all  practical  situations,  the  unique  minimum  point 
(X*.  Y*)  will  be  one  of  the  intersection  points.  The  reason  for  this  is  that,  given  the  result 
(x*,  yk)  of  the  Acth  iteration,  the  method  of  choosing  yk  guarantees  that  (x*,  yk)  lies  on  one  of 
the  lines;  then  we  always  move  xk  (in  the  direction  indicated  by  the  sign  of  Gk)  until  an  inter¬ 
section  is  reached.  The  fact  that  the  solution  (X*.  Y*)  must  be  an  intersection  point  provides 
an  upper  bound  on  the  time  spent  in  the  iteration  process.  In  fact,  a  "brute  force"  approach 
could  be  used  to  find  ( X *,  Y*)  by  calculating  E*(x,  y)  for  each  point  of  intersection  (x,  y)  and 
then  identifying  that  intersection  for  which  £*(x,  y)  is  the  smallest. 

If  N  is  large,  the  brute  force  procedure  may  be  more  time-consuming  than  the  minimiza¬ 
tion  approach.  However,  it  may  possibly  be  more  easily  adapted  to  continual  updating  of  the 
least-absolute-deviations  fix  in  situations  where  the  position  lines  are  reported  sequentially.  A 
large  jump  in  the  location  of  the  least-absolute-deviations  fix  after  a  number  of  sequential  steps 
may  indicate  either  the  presence  of  a  section  object  or  movement  on  the  part  of  the  original 
one. 

3.  AN  EXAMPLE 

In  Figure  3,  we  have  graphed  a  set  of  particularly  exceptional,  but  very  illustrative,  direc¬ 
tion  lines  indicated  by  the  ( 6 ,  p )  ordered  pairs  (45°,  3),  (45°,  6),  (135°,  3),  and  (135°,  6).  All 
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a-j's  equal  1.  Every  point  (x,  y)  inside  and  on  the  boundary  of  the  shaded  square  qualifies  as  a 
least -absolute  deviation  solution  (X*.  Y*)  with  E*(X*.  Y*)  —  6.  The  least-squares  solution  is 
the  point  X11  »  0,  l1*  —  6.36396  at  the  center  of  the  square,  and  PiX*,  1’*)  ■■  9.  Suppose 
we  append  a  fifth  line,  with  (<j>,  p)  —  (40°,  20),  to  the  set  of  direction  lines  in  Figure  3.  Then 
the  weight  produced  by  this  new  line  pulls  the  least-squares  fix  to  the  point  X“  —  -4.06242, 
V**  -  9.77273  outside  of  the  square,  while  the  least-absolute-deviations  fix  X*  —  -2.12132,  Y* 
-  6.36396  remains  at  the  far  left  boundary  of  the  square.  It  would  be  reasonable  to  assume 
that  the  line  (40s,  20)  is  an  outlier  with  respect  to  the  object  we  are  trying  to  locate;  the  least- 
absolute-deviations  fix  is  not  pulled  off  course  by  the  presence  of  the  outlier,  but  the  least- 
squares  fix  is. 
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The  distinction  between  the  two  fixes  becomes  much  more  spectacular  if  the  fifth  direc¬ 
tion  line  is  (0,  p)  —  (30°,  100).  In  this  case  there  is  virtually  no  question  at  all  that  the  line 
must  be  associated  with  a  different  object.  Yet  the  least-squares  fix  is  pulled  all  the  way  to  X * 
-  -27.949,  i*  -  22.5003.  The  least-absolute-deviation  fix,  however,  automatically  eliminates 
the  effect  of  the  outlier  and  remains  at  the  corner  of  the  square  formed  by  the  first  four  lines. 

4.  CONFIDENCE  REGIONS  FOR  THE  LEAST- ABSOLUTE-DEVIATIONS  FIX 

Under  the  assumption  that  the  direction  lines  and  their  associated  parallel  displacement 
errors  are  independent  and  the  errors  have  zero  median  (i.e.,  the  true  position  is  as  likely  to  lie 
on  one  side  of  the  direction  line  as  on  the  other),  Daniels  [1]  presented  a  method  of  construct¬ 
ing  distribution-free  confidence  regions  for  the  true  position.  On  page  192  of  [1]  it  is  shown 
that,  if  there  are  n  direction  lines,  the  probability  is  P„  **  1  —  n 2I_"  that  the  true  position  lies 
within  the  largest  closed  polygon  having  as  sides  segments  of  the  lines  and  as  vertices  pairwise 
intersections  of  the  lines.  Taking  n  —  3,  for  example,  the  probability  is  0.25  that  the  true  posi¬ 
tion  lies  within  the  triangle  formed  by  a  set  of  three  direction  lines.  Taking  n  *»  50,  the  proba¬ 
bility  is  only  8.88  x  10~u  that  the  true  position  lies  outside  the  extremely  large  polygon  formed 
by  the  50  direction  lines.  For  w  *  3,  the  region  is  small  but  the  confidence  level  is  too  low, 
while  for  n  —  50,  the  confidence  level  is  high  but  the  region  is  too  large.  What  we  need  is 
some  compromise  between  these  two  extremes. 

For  purpose  of  discussion,  suppose  we  aim  for  a  95%  confidence  region  containing  the 
true  position.  First  we  can  rank  ail  the  direction  lines  according  to  their  distance  from  the 
median  estimated  position  (Xm,  Y*).  The  largest  closed  polygon  formed  by  the  nearest  nine 
direction  lines  includes  the  true  position  with  probability  P9  —  1  -  (9)(2-8)  —  .964844.  Now 
we  have  to  deal  with  the  remaining  direction  lines.  Making  the  further  assumption  that  the 
parallel  displacement  errors  are  normally  distributed  random  variables  having  mean  0  and  vari¬ 
ance  a-},  the  probability  is  2<t>(<a'y/<r,)  -  1  that  the  jth  direction  line  is  too  far  from  the  point 
( X *,  Y*)  to  be  considered  as  an  estimate  of  the  true  position.  Here  dj  is  the  distance  of  the  Ah 
direction  line  from  ( X *,  Y*),  and  4>(w)  —  (2ir)~l/2  f  ^  exp  (- w2/2)dw.  Therefore,  if  the  n 
independent  direction  lines  are  ranked  from  I  to  n  in  order  of  their  distance  from  (X*.  Y*), 

n 

then  the  probability  is  {2<t>  (d/o-j)  -  1)  that  all  direction  lines  except  the  nearest  nine  are 

not  involved  in  any  way  with  the  true  position.  It  therefore  follows  that  the  largest  polygon 
formed  by  the  k  direction  lines  nearest  to  the  point  (X*,  Y*)  contains  the  true  position  with 
probability 

(1  -  *2'"*)  ft  (2 *(dj/<Tj)  -  1). 

J-k 


As  an  indication  of  how  this  confidence  level  formula  operates,  consider  a  set  of  n  —  100 
direction  lines.  Suppose  the  first  nine  direction  tines  have  normalized  distances  dja  j  less  than 
3.25,  the  10th  to  14th  lines  have  djaj  between  3.25  and  3.75,  the  15th  to  the  30th  have  djvj 
between  3.75  and  4.0,  and  the  remaining  70  lines  have  rf/<ry  in  excess  of  4.0.  In  this  not 
unlikely  scenario,  the  probability  that  the  largest  polygon  formed  by  the  nearest  nine  lines  con¬ 
tains  the  true  position  is  at  least 

(.964844)  (.9988) 5  (.9998)16  (.99994) 70  -  .951997, 
i.e.,  this  polygon  is  a  95.2%  confidence  region  for  the  true  position. 
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Consider  the  example  of  the  20  direction  lines  graphed  in  Figure  4.  The  parameters  of 
these  lines  are  listed  in  Table  1. 
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TABLE  1  —  Parameters  of 
the  20  Direction  Lines 


Line 

Pj 

_Zj_ 

1 

30 

16 

2 

2 

85 

3 

1 

3 

99 

20 

3 

4 

179 

12 

2 

5 

123 

23 

1 

6 

46 

7 

3 

7 

40 

6 

2 

8 

156 

17 

2 

9 

24 

30 

3 

10 

160 

15 

3 

11 

80 

1 

1 

12 

155 

10 

1 

13 

174 

15 

1 

14 

99 

4 

2 

15 

34 

6 

1 

16 

89 

18 

1 

17 

140 

20 

1 

18 

91 

2 

2 

19 

95 

26 

2 

20 

135 

2 

3 

The  least-absolute-deviations  median  position  is  at  X*  —  9.35079,  Y*  —  18.166. 
Arranged  in  order  of  nearness  to  the  median  position,  the  lines  are  listed  again  in  Table  2.  If 
we  form  the  polygonal  region  bounded  by  or  including  the  nine  lines  16,  10,  6,  3,  8,  17,  7,  4, 
and  5,  we  note  from  Table  2  that,  of  the  remaining  11  lines,  2  have  rfy/o-y  in  excess  of  3.54  and 
the  other  9  have  </y/o-,  in  excess  of  3.80.  Because  2«t>(3.54)  —  1  —  .9996  and  20(3.80)  -  1  =* 
.9998,  the  polygonal  confidence  region  illustrated  in  Figure  5  is  at  least  a 

(.9648)  (  9996)2  (,9998)9  -  .9623  -  96.23% 

confidence  region  for  the  true  (median)  position.  By  way  of  comparison,  the  least-squares 
mean  position  is  at  the  point  -  8.68216,  Y*  -  13.0144.  The  standard  error  of  the  estimate 
is  10.8974,  and  the  distance  between  the  mean  and  median  positions  is  5.19479.  The  96.23% 
confidence  ellipse  centered  at  the  mean  position  is  illustrated  in  Figure  4  and  has  semimajor 
axis  of  length  1.2413,  semiminor  axis  of  length  1.0410,  and  angle  of  axis  rotation  of  3.537 
degrees  in  the  counterclockwise  direction. 

Regarding  the  situation  illustrated  in  Figure  4,  a  number  of  interesting  points  may  be 
raised.  Note  that  none  of  the  20  position  lines  passes  through  (or  even  very  near)  the  least- 
squares  ellipse.  The  polygon  containing  the  least-absolute-deviations  fix,  however,  is  located  at 
the  confluence  of  several  position  lines.  In  this  example,  even  the  99.999%  confidence  ellipse 
centered  at  the  least-squares  fix  would  have  no  position  lines  passing  through  it.  (The  semima¬ 
jor  axis  of  the  99.999%  ellipse  is  2.33,  but  the  mean  is  at  distances  2.39  from  line  10,  3.37  from 
line  12,  and  3.67  from  line  6.)  What  seems  to  be  happening  here  is  that  the  ellipse  is  occupy¬ 
ing  a  balancing  position  between  the  confidence  polygon  above  it  and  the  second,  less  concen¬ 
trated,  confluence  of  lines  below  it.  Remember  that  the  least-squares  fix  has  the  property  that 
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TABLE  2  —  Direction  Lines  Arranged  in  Order  of 
Nearness  to  Median  Position 


Line 

dj 

dj/<Tj 

Daniels  Confidence 
Level  of  Region  Formed 
By  this  and  Nearer  Lines 

16 

0.0 

0.0 

0.0 

10 

0.0 

0.0 

0.0 

6 

.428 

.143 

.25 

3 

.595 

.198 

.5 

8 

1.069 

.534 

.6875 

17 

1.160 

1.160 

.8125 

7 

1.486 

.743 

.8906 

4 

2.334 

1.167 

.9375 

5 

2.672 

2.672 

.9648 

15 

3.594 

3.594 

.9805 

13 

3.802 

3.802 

.9893 

12 

6.152 

6.152 

.9941 

19 

7.088 

3.544 

.9968 

2 

14.282 

14.282 

.9983 

1 

15.015 

7.508 

.9991 

11 

15.266 

15.266 

.9995 

14 

15.405 

7.703 

.9997 

18 

16.326 

8.163 

.9999 

20 

17.457 

5.819 

.99993 

9 

31.154 

10.385 

.99996 

the  vector  sum  of  its  distances  from  all  position  lines  is  0.  If  line  9  were  located  very  far  to  the 
left,  therefore,  the  least-squares  ellipse  would  be  drawn  across  the  .p-axis.  The  least-absolute- 
deviations  polygon,  however,  would  remain  unchanged  by  any  movement  of  the  outlying  lines 
away  from  the  center.  In  summary,  then,  the  least-squares  ellipse  is  likely  to  be  located  any¬ 
where  (not  necessarily  in  the  vicinity  of  any  object),  if  there  are  several  outliers  among  the  set 
of  position  lines,  while  theleast-absolute-devialions  polygon  will  span  the  centermost  concentra¬ 
tion  of  position  line  intersections. 
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ABSTRACT 

An  important  class  of  network  Row  problems  is  that  class  for  which  the  ob¬ 
jective  is  to  minimize  the  cost  of  the  most  expensive  unit  of  flow  while  obtain¬ 
ing  a  desired  total  flow  through  the  network.  Two  special  cases  of  this  problem 
have  been  solved,  namely,  the  bottleneck  assignment  problem  and  time¬ 
minimizing  transportation  problem.  This  paper  addresses  the  more  general 
case  which  we  shall  refer  to  as  the  time-minimizing  network  flow  problem.  As¬ 
sociated  with  each  arc  is  an  arc  capacity  (static)  and  a  transferral  time  The  ob¬ 
jective  is  to  find  a  maximal  flow  for  which  the  length  (in  time)  of  the  longest 
path  carrying  flow  is  minimized.  The  character  of  the  problem  is  discussed  and 
a  solution  algorithm  is  presented. 


I.  INTRODUCTION 

A  natural  extension  of  the  bottleneck  assignment  problem  [7]  and  the  time-minimizing 
transportation  problem  [8]  is  the  time-minimizing  network  flow  problem.  The  time-minimizing 
network  flow  problem  is  one  of  finding,  among  all  maximal  flows  in  a  network,  the  one  which 
minimizes  the  longest  time  on  any  path  carrying  flow. 

We  can  develop  a  formal  definition  of  the  time  minimizing  network  flow  problem  as  fol¬ 
lows.  Let  Xj  denote  the  flow  on  path  j.  b,  the  capacity  of  arc  i,  a,  the  time  (length)  of  arc  i, 
and  P  the  arc  -  path  incidence  matrix  (p^  -  1,  if  arc  i  is  in  path  j  and  0  otherwise).  Then  the 
length,  lj,  of  a  path  in  the  network  is  defined  as 

where  P )  is  the  arc  incidence  vector  of  path  j. 

Let  x  —  U,1  be  a  vector  of  path  flows,  fl(x)  the  set  of  basic  paths,  and  let  v^*  be  the 
maximal  flow  in  <7;  then  the  time-minimizing  network  flow  problem  is* 


'Note  that  this  formulation  will  tend  to  drive  a  longest  path  out  of  a  basis  even  if  it  is  in  at  a  zero  level. 
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PROBLEM  I: 

(1)  Minimize  maximum  [a  /*,! 

(x)  Pj  €  B(x) 

(2)  Subject  to  £  xj  >  vmax 

j 

(3)  X  Po  xj  ^  b..  i  -  1 . m 

j 

(4)  x  ^  0. 

As  is  shown  by  the  example  in  Figure  1  and  Table  1,  Problem  I  need  not  have  an  all¬ 
integer  optimum  solution,  he  value  of  the  objective  (1)  for  this  problem  is  L  -  25,  where  L 
is  the  length  of  the  longest  path  carrying  flow.  Since  the  flows  are  fractional,  we  would  not 
expect  that  there  exists  an  efficient  algorithm  (like  that  for  maximal  flows)  for  finding  time 
minimizing  flows  in  G. 


Figure  I  Network  with  fractional  time-minimizing  flow 


TABLE  1  —  ( Unique )  Time  Minimizing 
Flow  Solution  for  Figure  I 
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The  algorithm  proposed  herein  for  solving  the  time  minimizing  network  flow  problem  is 
based  on  the  arc  path  formulation  of  the  maximum  flow  problem.  The  algorithm  proceeds  by 
iteratively  removing  the  flow  from  the  longest  path(s)  in  the  current  basic  feasible  solution,  to 
Problem  I.  The  steps  of  the  algorithm  will  be  developed  along  with  proofs  necessary  to  estab¬ 
lish  its  validity.  Finally,  an  example  is  presented. 

2.  ALGORITHM  DEVELOPMENT 


Suppose  that  we  have  an  optimal  solution  to  the  maximal  flow  problem  in  arc-path  form 
and  have  enumerated  all  paths  in  the  networx  from  the  source  s  to  the  sink  t.  (Employing 
column  generation,  it  is  not  actually  necessary  to  explicitly  enumerate  all  paths.)  Let  L  be  the 
length  of  the  longest  path  carrying  flow  in  this  solution. 

To  determine  whether  there  exists  a  solution  to  the  maximum  flow  problem  with: 

L  L-  1. 

we  solve  the  following  problem: 


PROBLEM  2: 


(5) 

Min  Z  =  £  8,.v( 

(6) 

Subject  to  Px  <  b 

(7) 

•’O  ^  vma* 

J 

(8) 

x  >  0. 

where  8,  is  defined  as  follows  (R  »  0  to  be  specified  later): 


R,  if  j  nonbasic  and  /,  >  L 
1 ,  if  j  basic  and  /,  >  L 
0.  if  Ij  <  1 


If  the  optimal  solution  to  Problem  2  has  Z*  =  0,  then  this  solution  has  a  (min-max  path) 
value  of 


L  L  —  1. 

and  hence  is  a  better  solution  to  the  time-minimizing  flow  problem.  If  Z*  is  positive,  then  the 
previous  solution  was  optimal  and  the  time-minimizing  flow  has  value  L. 

Associating  a  Lagrange  multiplier,  p  >  0,  with  constraint  (6),  we  can  reformulate  Prob¬ 
lem  2  as  follows: 

PROBLEM  3. 

(9)  min  Z  -  £  8,x,  -  P  £  ■*)  “  £  (S,  -  p )  Xj 

i  j  j 

( 10)  Subject  to  Px  <  b 

(11)  x  ^  0. 
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It  should  be  clear  that,  for  p  sufficiently  large,  any  optimal  solution  to  Problem  3  will  have 
£  x,  —  vmail,  and  thus  constraint  (6)  will  be  satisfied.  Hence,  by  redefining  8 y  each  time  L  is 

i 

reduced  we  may  concentrate  on  Problem  3  in  order  to  obtain  the  solution  to  Problem  1. 

Since  constraints  (10)  and  (11)  are  exactly  the  feasible  set  of  the  maximal  flow  problem 
(in  arc-path  form),  we  may  initiate  Problem  3  with  any  maximal  flow.  (We  shall  indicate  how 
to  find  a  maximal  flow  in  arc  path  form  later).  Assume  the  current  basic  feasible  solution  has 
been  ordered  so  that  those  paths  with  l}  -  L  are  first,  those  with  /y  <  L  are  next,  followed  by 
any  slack  variables.  Let  /  —  [j\j  basic,  /,  —  L)  and  J  —  {j \j  basic,  lj  <  L).  Then 
cB  -  ((1  -  p):  -p:  0)  since  8 7  —  1  for  /  €  /  and  8,-0  for  /'  €  J.  Finally,  if  we  denote  the 
dual  vector  for  Problem  3  by  p  then 

(12)  p  =  cB  B~'  =  <(1  -  p):  -p:  0)B~' 

=  (i -p)  I sr'-P 

id  hj 

where  B,'1  is  the  Ah  row  of  B~K  Thus,  p  is  easily  computable  from  the  information  associated 
with  the  current  basic  solution. 


Note  that  there  is  another  way  to  compute  p  depending  on  what  information  i?  available. 
From  (12)  we  can  obtain 


Xflf'-P  £  B’~ 

,il  idUJ 


=  £  Br'  -  p  * 

id 


where  it  is  the  dual  vector  for  the  corresponding  maximal  flow  problem.  Thus,  we  see  that  p  is 
an  adjustment  of  it. 


We  shall  assume  that  pk  $  0,  for  if  such  is  not  the  case,  we  simply  enter  the  slack 
variable  sk  into  the  basis  and  obtain  pk  =  0.  We  shall  also  assume  that  trk  ^  0.  For,  if  not, 


then  zk  -  ck 
sk  into  the  basis. 


Pk 


I*. 

id 


-i 


-  pirk  >  0  since  p  »  0;  and  thus  we  may  enter  stack  variable 


To  identify  a  candidate  vector  to  enter  the  basis  for  Problem  3,  we  make  the  folio.jing 
observations.  First,  no  (nonbasic)  path  with  /,  >  L  is  a  candidate  since  8,  —  R  »  0  will  pro¬ 
duce  r,  ■  Cj  -  p  Pj  -  (R  -  p)  ^  0.  In  fact,  since  p  Pj  <  0  we  now  see  that  it  suffices  to 
choose  R  -  p.  Thus,  we  may  focus  on  (simple)  paths  Pk  which  satisfy  p  Pj  -  ( -p )  >  0  (i.e. 
(— o)  Pj  <  p)  and  aPj  <  L  —  1. 


If  one  exists,  we  can  locate  a  candidate  path  satisfying  the  aforementioned  conditions  by 
solving 

minimize  ( — p)  Ps 
Subject  to  aPj  <  L  -  1 
j  a  path. 

This  is  a  constrained  shortest  path  problem,  for  which  a  number  of  solution  algorithms  [1,  10, 
12]  exist. 
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3.  ALGORITHMIC  DETAILS 

Figure  2  presents  a  flow  chart  of  the  algorithm  developed  in  the  previous  section.  Opera¬ 
tions  0 and® are  simplex  pivots.  Operations(2)and(2>are  simple  computations.  Operation  5  is 
somewhat  more  involved,  but  as  indicated  previously  several  algorithms  do  exist  for  the  solu¬ 
tion  of  this  subproblem. 

Operation  0  appears  to  be  quite  involved.  However,  a  very  simple  method  exists  for 
developing  an  initial  basis  and  its  inverse.  Employ  any  labeling  method  (e.g.,  see  [3])  to  find  a 
maximal  (node-arc)  flow.  Establish  a  path  decomposition  of  this  flow  as  follows. 

0.  Set  k  -  1 


1.  Label  along  arcs  with  positive  flow  to  obtain  a  path,  k ,  from  source  to  sink. 


Put  as  much  flow  on  path  k  as  possible,  thereby  reducing  some  arc  flow  (on  the 
path)  to  zero.  Reorder  the  arcs  so  that  this  becomes  arc  k.  Establish  Pk. 

Set  k  —  k  +  1  and  repeat  steps  1  and  2  i  all  flow  has  been  decomposed.  Go 
to  step  4  with  the  last  value  k  -  q. 

Fill  out  the  remaining  basis  members  with  slack  variables  for  each  of  the  remain¬ 
ing  arcs.  Then 


B  « 


/>,.  P2.  ...  Pa 


0 


Note  that  B  is  upper  triangular  since  Pk  has  zeros  in  elements  1,2 . k  -  1  and  a  1 

in  element  k.  Thus, 

B  '  -/-<£-/). 


4.  EXAMPLE 

We  shall  use  the  network  of  Figure  3  to  illustrate  the  calculations  for  solving  the  con¬ 
strained  shortest  path  problem.  The  circled  number  adjacent  to  each  arc  is  the  flow  on  that  arc 
with  respect  to  maximum  flow  on  the  network.  All  arc  capacities  are  one  and  the  arc  lengths 
are  Riven  on  each  arc. 

The  maximal  flow  (minimal  cost)  solution  is  given  by  one  unit  of  flow  on 
P\  -  {l,  4,  8,  10|  and  one  unit  of  flow  on  P2  -  {2,  6,  11}.  Wc  consider  Pt  and  P2  to  be  basic 
with  respect  to  arcs  one  and  two,  respectively.  The  remainder  of  the  basis  consists  of  the 
appropriate  slack  variables. 
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rs 


Figure  2.  Flow  chart  or  the  time-minimizing  network  flow  algorithm 
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For  this  maximal  flow  solution: 


and 


B  - 


10000000000 

01000000000 

00100000000 

10010000000 

00001000000 

01000100000 

00000010000 

10000003000 

00000000100 

10000000010 

01000000001 


B~'  - 


1  0000000000 
0  1000000000 
0  0100000000 
-1  0010000000 
0  0001000000 
0  -1000100000 
0  0000010000 
-l  0000001000 
0  0000000100 
-1  ooooooooio 

0  -I  000000001 


After  two  iteractions  we  still  have  x,  -  l,  and  x2  —  l,  but  we  also  have 
P3  -  {l,  3,  7,  10}  and  P4  -  |2,  5,  8,  10}  in  the  basis  (for  slacks  8  and  10,  respectively)  with 

vj  -  x4  -  0. 
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At  this  point  we  have: 

L  -  l2-  12, 

I  -  1-11, 


and 


B~l  - 


Also,  we  have 


1 

000000 

10-10 

1 

100000 

0  0-10 

0 

0  1  0  0  0  0 

10-10 

-1 

001000 

-10-10 

1 

000100 

0  0-10 

-1 

-1  0  0  0  1  0 

0  0-10 

0 

000001 

10-10 

-1 

000000 

0  0  10 

0 

000000 

0  1  0  0 

0 

000000 

-10  10 

-1 

-1  0  0  0  0  0 

0  0  11 

it  -  (1.  1.  0,  0,  0,  0,  0.  0,  0,  0,  0). 

Since  the  second  basic  path  is  the  longest,  we  have 

B{'  -  (1,  1,  0,  0,  0,  0,  0,  0,  0,  -1,  0). 

If  we  let  p  -  4,  we  have 

pn  -  B2'  -  (3,  3,  0,  0,  0,  0,  0,  0,  0,  1,  0). 

Thus,  we  wish  to 

minimize  (3,  3,  0,  0,  0,  0,  0,  0,  1,  0)  Pj 
Subject  to:  (1,  5,  3,  2,  1,  4,  3,  1,  1,  2,  3)  Pj  ^  11 
Pj  is  a  path  from  s  to  t. 

The  optimal  solution  to  this  constrained  shortest  path  problem  is 
P5  -  {2,  5,  8,  9,  11}. 

If  we  bring  this  path  into  the  basis  (replacing  P2)  we  obtain  a  solution: 
x5  -  x3  -  1 

*1  -  *4  -  0. 

The  remaining  basic  variables  are  slack  variables. 

In  Figure  4  the  arc  flows  corresponding  to  the  current  solution  are  shown  by  circled 
numbers.  For  this  solution 

L  -  /5-  11 

L-  1-10, 
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0 

-1 

0  0  0  0  0 

1 

0 

0  0 

1 

1 

0  0  0  0  0 

0 

0 

-I  0 

-1 

-1 

10  0  0  0 

1 

0 

0  0 

0 

I 

0  1000 

-1 

0 

0  0 

0 

-1 

0  0  10  0 

0  0 

0  0 

0 

0 

0  0  0  10 

0 

0 

0  0 

-1 

-1 

0  0  0  0  1 

1 

0 

0  0 

-1 

0 

0  0  0  0  0 

0 

0 

1  0 

-1 

-1 

0  0  0  0  0 

0 

1 

1  0 

1 

1 

0  0  0  0  0 

1 

0 

0  0 

-1 

-1 

0  0  0  0  0 

0 

0 

1  1 

The  simplex  multipliers  are 

7 r  -  (I.  1,  0.  0,  0,  0,  0,  0,  0,  0,  0). 

If  we  let  p  —  4,  we  have 

d-  4rr-  Bfl  -  (3.  3,  0.  0,  0,  0,  0,  0,  0,  1,  0). 

The  optimal  solution  to  the  corresponding  constrained  shortest  path  problem  is 
P6  -  {1,  4,  8,  9,  11).  After  pivoting  we  have 

*5  “  *3  “  1 
*6  “  *4  "  0. 

The  longest  path  in  the  basis  remains  P%.  We  have 
L  -  1  -  10 

flf1  -  (1,  2,  0.  0,  0,  0,  0,  -1.  0,  -1.  0) 

it  -  (1,  1.  0,  0,  0,  0,  0,  0,  0,  0,  0) 

4ir  -  fi2-‘  -  (3,  2,  0,  0,  0,  0,  0,  1,  0.  1,  0). 
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With  these  arc  costs,  the  optimal  constrained  shortest  path  is  Pi  with  reduced  cost  equal  to  0. 

Thus,  we  conclude  that  the  current  solution  to  the  problem  is  optimal. 
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ABSTRACT 

Transportation  problems  with  uncertain  demands  are  useful  applied  models 
themselves,  and  also  they  represent  in  a  formal  way  the  problem  of  estimating 
demands  for  use  in  deterministic  models.  We  consider  the  effects  of  using  a 
small,  aggregate  model  of  this  type  in  place  of  a  larger,  more  detailed  one. 
Formulation  of  the  aggregate  objective  function  turns  out  to  depend  on  how 
one  chooses  to  use  ('disaggregate*)  the  solution;  several  alternative  methods 
are  examined.  Bounds  are  derived  on  the  error  induced  by  the  approximation, 
thus  facilitating  comparison  of  alternative  aggregations.  We  also  consider  the 
problem  of  estimating  demands  for  an  aggregate-level  deterministic  problem. 
In  a  specific  sense,  it  is  often  not  the  case  (as  one  might  expect)  that  such  ag¬ 
gregate  demands  are  easier  to  estimate  than  the  detailed  demands.  This  is  be¬ 
cause  aggregation  and  centralization  are  not  the  same  thing. 


INTRODUCTION 

This  is  a  study  of  transportation  problems  whose  destinations  represent  aggregates  of 
smaller,  ’original"  destinations,  and  where  the  demands  at  the  destinations  are  not  known  with 
certainty. 

The  models  analyzed  represent  uncertainty  by  means  of  an  additional  cost  incurred  when 
total  shipments  to  a  destination  exceed  or  are  exceeded  by  the  true  demand.  Such  additional 
costs  may  represent  emergency  procurement,  lost  sales,  disposal,  salvage,  and  the  like.  Thus, 
the  models  are  stochastic  linear  programs  with  simple  recourse,  representable  as  ordinary  non¬ 
linear  programs.  These  have  been  studied  extensively  by  Williams  [10,  11,  121,  among  others. 
We  shall  refer  to  such  a  ’transportation  problem  with  uncertain  demands”  as  a  TPUD. 

This  paper  has  two  purposes,  reflecting  two  distinct  aims  of  prior  studies  of  the  TPUD. 
First,  a  TPUD  can  itself  be  used  in  practice,  like  other  models,  to  help  solve  a  given  decision 
problem  (e.g.,  [4]).  From  this  viewpoint  our  analysis  follows  the  same  lines  as  earlier  work  on 
aggregation  of  deterministic  network-type  problems  (Kuhn  and  Cullen  [8],  Geoffrion  [S,  6], 
Evans  (2,  3],  and  Zipkin  [13,  16]): 

Suppose  a  large,  original  TPUD  is  given  (Section  2),  and  we  approximate  it  by  aggregating 
groups  of  destinations.  Upon  solving  the  aggregate  problem,  suppose  we  were  to  disaggregate 
its  solution,  obtaining  a  feasible  solution  to  the  original  problem.  The  cost  of  such  a  disaggre¬ 
gated  solution  would  provide  a  natural  specification  of  the  objective  function  of  the  aggregate 


VOL.  29,  NO.  2,  JUNE  1982 


257 


NAVAL  RESEARCH  LOGISTICS  QUARTERLY 


258 


P.  ZIPKIN 


problem,  as  well  as  a  natural  criterion  for  the  error  resulting  from  substituting  the  aggregate  for 
the  original  problem.  In  Section  3  we  consider  two  distinct  approaches  to  disaggregation, 
"fixed-weight”  and  "optimal”  disaggregation.  In  every  case,  the  aggregate  problem  is  itself  a 
TPUD,  and  under  certain  assumptions  ((Al)  and,  for  fixed-weight  disaggregation,  (A2)),  the 
demand  distributions  in  this  problem  are  of  reasonably  simple  form.  Under  optimal  disaggrega¬ 
tion  the  aggregate  objective  function  is  derived  by  an  approximation  method  which  has  not 
been  used  before  in  this  context. 

Section  5  presents  bounds  on  this  measure  of  error,  which  do  not  require  solution  of  the 
original  TPUD.  These  may  be  used,  therefore,  to  help  choose  among  alternative  aggregations 
and  to  evaluate  the  solution  for  a  particular  aggregation.  Analysis  of  the  form  of  the  bounds 
also  suggests  qualitatively  which  factors  are  important  in  the  design  of  good  aggregations. 

The  second  reason  for  studying  the  TPUD  is  that  it  represents  a  formalization  of  the 
problem  of  estimating  demands  for  use  in  a  classical  transportation  problem  (CTP).  The  objec¬ 
tive  function  of  the  TPUD  is  now  interpreted  as  the  total  expected  loss  (in  the  Bayesian  sense) 
as  a  function  of  the  estimates,  where  the  estimates  correspond  to  total  shipments  to  the  respec¬ 
tive  destinations.  (This  interpretation  is  suggested,  for  example,  in  [12]  for  general  stochastic 
linear  programs.) 

When,  as  is  often  the  case,  the  destinations  of  the  CTP  represent  aggregates,  it  is  natural 
to  use  the  true  distribution  of  total  demand  as  the  basis  for  (Bayesian)  estimation  of  demand  at 
each  destination.  The  discussion  in  Section  4,  based  on  examination  of  the  aggregate  objective 
function,  suggests  that  this  approach  is  incorrect,  unless  aggregation  of  the  model  is  accompanied 
by  actual  centralization  in  the  system  being  modeled.  The  correct  distributions  have  larger  vari¬ 
ances  than  those  of  the  true  aggregate  demands. 

Similarly,  if  demands  are  estimated  by  the  usual  classical  techniques,  we  argue  that  aggre¬ 
gation  per  se  may  not  reduce  overall  data  requirements. 

These  peculiar  results  are  explained  by  the  difference  between  aggregation  of  a  model  and 
actual  centralization  of  the  distribution  system  represented  by  the  model.  They  demonstrate, 
we  believe,  that  the  usual,  intuitive  view  of  the  meaning  of  aggregation  may  in  some  cases  be 
misleading. 

2.  THE  ORIGINAL  PROBLEM 

We  state  the  original,  detailed  TPUD  as  follows: 

**  “  min  J^CyXy  +  £a,0-/) 
u  J 

(1)  subject  to  <u  <  sif  i  €  / 

j 

£*</  -  yj  -  o,  j  e  / 

/ 

xu  >  0,  I  €  /,  j  €  J, 

where 

ojiyj)  -  yj  J~  (f  -  yj)dFj(()  -  8,  /_%  -  t)dFt( (). 
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The  notation  (mostly  following  Williams  {11])  is  defined  as  follows: 

1  -  index  set  for  sources 

J  -  index  set  for  destinations 

Xy  -  shipment  from  source  i  to  destination  j 
yy  -  total  shipment  to  destination  j 

c,j  -  unit  shipment  cost  from  /  to  j 

s ,  —  supply  at  source  / 

Fj  —  cumulative  distribution  of  demand  at  destination  j 
yj  -  unit  cost  of  undersupply  at  j 

8 j  -  unit  salvage  value  (or  minus  oversupply  cost)  at  j 

z*  -  minimal  total  cost. 

The  summations  are  defined  over  appropriate  subsets. 

The  following  assumptions  are  made  to  avoid  trivialities  and  to  simplify  the  notation: 

a.  y j  >  min^Cyl  ^  6,  and  y;  >  S;,  j  €  / 

b.  Sj  >  0,  i  €  /. 

c.  Each  Fj  has  finite,  positive  mean  py.  Also,  each  Fy  is  sufficiently  continuous  that 
aj  can  be  written  in  the  simpler  form, 

(2)  a,(yj)  -  yjtij  -  yJy)  +  (y,  -  8,)  F,(£)rff. 


These  assumptions  are  sufficient  to  ensure  that  (1)  is  a  convex  program  with  a  finite  optimum. 
Let  x*  -  (x*)  and  y*  -  O'*)  denote  such  an  optimum.  Also,  let  n*  «  (nf)  and  p*  —  (pj) 
denote  an  optimal  dual  solution  to  (1). 

The  optimality  conditions  for  problem  (1)  are  as  follows: 

it*  <  0  and  7t*(£x*  -  st)  -  0 ,  i  €  /; 

j 

it  *  +  p  *  <  ctJ  and  x*j (ir  *  4-  p  *  -  c0)  —  0.  i  €  /,  j  €  /; 

(x*.  y*)  feasible  for  (1); 

inf  {>,:  yy  >  0,  Fy  <yy)  >  (y,  -  pj)/(yy  -  8,)},  j  €  J. 

To  simplify  the  latter  condition  we  may  at  times  invoke  the  following  assumption: 

d.  Each  Fj  is  continuous  and  increasing  over  its  positive  range.  In  this  case,  the  last  con¬ 
dition  reduces  to  the  following: 

(3)  Fj(y*)  -  ( yj  -  p,*)/(y,  -8 y),  j  Z  J. 

Viewing  (l)  as  a  problem  of  Bayesian  estimation  of  demands,  the  condition  (3)  suggests  a 
heuristic  approach  to  determining  "certainty  equivalents"  y'  [12]:  First  try  to  estimate  each  mar¬ 
ginal  shipment  cost  py.  Then,  use  the  results  with  (3)  to  estimate  the  y*.  The  role  played  by 
the  Fj  (interpreted  as  summarizing  beliefs  concerning  "true”  demands)  in  this  "metamodel*  of 
the  estimation  process  will  be  significant  in  the  discussion  in  Section  4  below. 
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3.  AGGREGATION  AND  DISAGGREGATION 

Aggregation :  By  aggregation  of  destinations  we  mean  the  following:  Partition  the  destina¬ 
tions  J  into  subsets  Tk,  k  6  K,  and  replace  each  subset  Tk  by  the  single  "aggregate"  destination 
k.  No  restrictions  are  placed  on  the  partition.  (If  the  original  problem  includes  only  a  subset  of 
the  arcs  I  x  J,  then  each  j  €  Tk  must  be  linked  to  the  same  subset  of  sources,  k  €  K.) 
Denote  the  aggregate  problem  as  follows  (the  numbers  cfk  and  the  functions  a4  are  thus  far 
undefined): 

r°  -  min  £  c4kx4k  +  £  afiyf) 

ik  k 

(4)  subject  to  <  s,  i  €  / 

k 

-  0,  k  €  K 

i 

xi  >  0,  /  €  /,  k  €  K. 


Let  AO)  denote  that  k  for  which  j  €  Tk,  j  6  J.  The  superscript  A  will  be  used  to  distin¬ 
guish  aggregate-level  variables  and  data.  Otherwise,  the  notation  is  analogous  to  that  of  prob¬ 
lem  (1).  Let  (x40,  y40)  denote  an  optimal  solution  to  (4)  (assuming  one  exists),  and 
(it40,  p40),  a  dual-optimal  solution. 

Aggregation  with  Fixed-Weight  Disaggregation:  Consider  the  following  method  of  defining 
the  objective  function  of  (4):  Choose  any  positive  weights  wjt  j  €  J  with  £  Wj  -  1,  and  set 

cfk  “  £  "jC.j.  i  €  A  k  €  K 
h 

fl^W)  -  £  V'V’*4*-  k  €  K 
h 

(Viewing  (1)  as  a  convex-cost  network-flow  problem,  this  objective  function  is  a  special  case  of 
the  one  proposed  in  (16].) 

This  method  of  aggregation  is  closely  related  to  fixed-weight  disaggregation:  Given  an 
optimal  solution  (x,^°.  yf°)  to  problem  (4)  and  weights  w  -  (*v;),  define  a  solution  (x°,  y°)  to 
problem  (1)  as  follows: 

-  *, *,*?,)•  i  €  /,  j  €  J. 

y,°  -  ‘VhV  j  €  j. 

It  can  be  shown  [16]  that  (x°.  y°)  is  feasible  for  (I)  and  that 

£c„<  +  £a,(y,°)  -  Jc4kx4k°  +  “  7°. 

ij  j  ik  k 

We  may  interpret  these  facts  as  follows:  The  ultimate  result  of  solving  (4)  is  the  feasible  (but 
perhaps  suboptimal)  solution  (x°,  y°).  Assuming  the  given  weights  w  are  to  be  used  for 
disaggregation,  solution  of  (4)  yields  that  solution  whose  disaggregation  is  of  minimal  possible 
cost  in  problem  (1). 

Evidently,  z°  ^  z*.  In  Section  5  we  shall  derive  bounds  on  z°  -  z*. 
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The  definition  of  the  ak  above  ensures  that  (4)  is  a  convex  program.  It  is  perhaps  less 
obvious  that  (4)  is  a  TPUD,  that  is,  that  ak  is  of  form  (2)  (except  perhaps  for  a  different  con¬ 
stant  term).  For  each  k  €  K,  define 

■y*  -  TwjVj‘  s*  “  X>Ar 

r*  rk 

We  have 

ak(yf)  *  Jj jUj  -  Jjj%yk  +  ~  s,)  f  J  *  Fjtydt- 

h  h  Tk 

The  first  term  is  a  constant,  and  the  second  term  is  simply  —yj?  yk.  Changing  variables,  and 
defining  v,  —  w,(y,  -8 ,)/(y*  —  8*0,  the  third  term  becomes 

I  -  V  ft  *  S&M  ~ 

h  h 

-  (y*  ~  8*)  J_'f  £v,F,(w,*)^ 

Tk 

-  (y*  -  )  J_'f  F£{f)dt. 

where  F(  (£)  -  £v,F,(w,£),  for  all  £.  Since  Tv,  -  1.  F*'  is  a  cumulative  distribution  func- 
tion,  so  a£  is  of  form  (2). 

F, *  satisfies  assumption  c  in  Section  2.  If  the  F,  satisfy  assumption  d,  so  does  Fk.  It  is 
easy  to  shown  that  min,{c^}  ^  Sj*  and  y(  >  8* ,  but  it  may  not  be  true  that  y(  ^  min,(c,*|. 
We  simply  assume  this  to  be  true;  if  not,  no  shipment  would  be  made  to  k. 

The  functions  Fk  are  point  mixtures  of  rescaled  versions  of  the  original  demand  distribu¬ 
tions  Fj.  j  €  Tk.  In  general,  distributions  of  this  form  are  not  easy  to  work  with.  For  exam¬ 
ple,  the  original  Fj  may  be  easy  to  invert,  as  required  by  the  optimality  conditions  (3),  but  this 
properly  may  not  be  inherited  by  the  F(.  Unless  the  Ff  do  have  simple  forms,  one  would 
probably  wish  to  approximate  them  in  practice. 

It  turns  out,  however,  that  under  certain  conditions  the  Fk  are  tractable.  Specifically,  sup¬ 
pose  the  following  two  assumptions  hold: 

(Al)  For  each  k  6  K  there  exists  a  probability  distribution  Hk  with  mean  1  such  that 
F,(()  -  Hk(t/p,)forallt  and  j  6  Tk. 

( A2)  Let  p. k  —  2 >i  and  m,  —  fijpk.  j  6  Tk,  k  €  K.  Set  the  weights  Wj  —  m,,  j  €  J. 
h 

The  first  assumption  states  that  the  demand  distributions  within  each  subset  are  identical, 
except  for  a  scale  factor.  Several  well-known  families  of  distributions  satisfy  this  condition,  for 
example: 
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(i)  Each  Fj  is  gamma  (or  each  is  Weibull,  or  each  is  Pareto),  with  a  common  shape 
parameter; 

(ii)  Each  Fj  is  normal,  with  a  common  coefficient  of  variation. 

(Al)  thus  imposes  a  not  entirely  unreasonable  model  of  demands  of  different  sizes  for  the 
same  item.  Assumption  (A2)  requires  chosing  weights  proportional  to  the  mean  demands, 
which  is  also  reasonable. 

Under  these  conditions  we  have,  for  any  f , 

L 

-  £v,//*(*w)  -  /#*(«w>. 

tk 

Thus,  Fk  is  of  the  same  family  of  distributions  as  the  Fjy  and  is  just  as  tractable.  The  constant 
term  in  a{  here  is  Tyy/t;  —  yk  (if. 


The  mean  of  F*  is  just  p.k ,  the  sum  of  the  individual  means.  Also,  let  crk  denote  the 
variance  of  Hk.  Then,  the  variance  of  Fj  is  (p.j<rk)2,  j  6  Tk,  and  the  variance  of  Fk  is 
(* ik<rk )2.  Thus,  Ff  would  be  the  distribution  of  the  sum  of  the  demands  at  destinations 
j  €  Tk,  if  those  demands  were  perfectly  correlated.  We  shall  return  to  this  observation  in  Sec¬ 
tion  4. 

Aggregation  with  Optimal  Disaggregation,  First  Form:  "Optimal"  disaggregation  has  been 
defined  (e.g.,  (13])  as  any  method  of  obtaining  a  solution  to  (1)  from  a  solution  to  (4)  which 
improves  on  the  solution  derived  from  fixed-weight  disaggregation,  usually  by  solving  by  solv¬ 
ing  one  or  more  relatively  simple  optimization  subproblems.  The  special  form  of  the  TPUD 
suggests  several  natural  forms  of  optimal  disaggregation. 

The  first  form  we  shall  consider  is  as  follows:  Define  y°  as  in  fixed-weight  disaggregation, 
but  obtain  x°  as  the  solution  to  the  CTP  with  demands  y°  and  costs  and  supplies  as  given  in 
(1).  Since  the  value  of  x  in  fixed-weight  disaggregation  is  feasible  for  this  CTP,  the  resulting 
solution  is  obviously  of  no  greater  (and  usually  of  lesser)  cost. 

It  would  be  useful  to  be  able  to  define  the  objective  function  of  the  aggregate  problem  to 
reflect  the  fact  that  optimal  disaggregation  would  be  used  to  recover  a  solution  to  (1).  For  this 
form  of  optimal  disaggregation,  unfortunately,  there  appears  to  be  no  such  definition  which  is 
clearly  better  than  the  one  above.  Solution  of  that  aggregate  problem,  therefore,  minimizes  an 
upper  bound  on  the  cost  of  the  solution  ultimately  obtained. 

In  Section  5  we  shall  mention  a  bound  on  the  suboptimality  of  this  (x°,  y0),  due  to  Willi¬ 
ams  [12], 

Aggregation  with  Optimal  Disaggregation,  Second  Form:  The  other  form  of  optimal  disaggre¬ 
gation  we  shall  consider  focuses  on  improvement  of  the  shortage/disposal  costs,  as  opposed  to 
the  distribution  costs.  First  we  describe  the  method,  and  then  justify  it. 
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If  yltlj)  *  0-  set  y j°  =  0,  and  x®  =*  0,  /  6  /,  as  in  fixed-weight  disaggregation.  Otherwise, 
define 

r,  -  <£  c,jXtl  )/yk%. 

i 

and,  for  each  k ,  solve  the  following  subproblem,  with  yk  —  v^°: 

+  minz[-(y>  ~  r Jy; +  ^vj  ~  8/)  J*  1 

r*  h 

(5)  subject  to  £  yt  - 

h 

yj  >0 ,j€  Tk. 

Let  yf  denote  the  optimal  y,  in  problem  (5)  for  k(J).  Then,  set 

(6)  x„°  -  ( yf/yffj )  )x*?,i.  i  €  /  •  j  €  J. 


First  of  all,  the  resulting  solution  (x°,  v°)  is  feasible  lor  (1),  since  x,®  >  0  for  all  i,  j,  and 

I<-  s„  i  €  /, 

>  * 

(.v//<,)£x^,  -  (tf/jCWfr  -  y°,  j  €  y. 

f  i 

Moreover,  the  distribution  cost  of  (x°,  y°)  is 

'Lc‘ixS-  "Zc'j  iy?/y$>  )x*l> 


ISvx&IMb 


y> 


-  Ir^°- 

j 

Thus,  the  objective  function  in  (5)  includes  the  distribution  cost  of  the  x°  ultimately  obtained. 
Finally,  the  y  from  fixed-weight  disaggregation  is  feasible  for  all  the  problems  (5),  and  if  it  is 
used  in  (6)  to  define  x,  the  result  is  just  the  x  of  fixed-weight  disaggregation.  Thus,  (x°,  y°)  is 
no  more  (and  usually  less)  costly  than  the  solution  derived  by  fixed-weight  disaggregation. 

Zipkin  [15]  (following  earlier  work  in  the  references  therein)  has  developed  quite  efficient 
methods  to  solve  problems  of  form  (5). 

For  this  form  of  optimal  disaggregation  it  is  possible  to  specify  the  objective  function  of 
(4)  so  as  more  nearly  to  reflect  the  true  cost  of  the  disaggregated  solution,  namely  £<?*iy*°). 

k 

This  requires  assuming  (Al)  and  d.  Zipkin  [17]  shows  that  a  function  of  the  form  of  qk  can  be 
well-approximated  by  an  explicit  function  of  the  following  form: 


(7) 


*W>  =  I  y jUj  -  (y*  -  rflyf  +  {y(  -  hi)  /  f  FiiOdt. 


VOL  29,  NO.  2.  JUNE  1982 


NAVAL  RESEARCH  LOGISTICS  QUARTERLY 


264 


P.  ZIPK1N 


Here,  the  terms  (yk  -  r* )  and  (y£  -  8**)  are  constants.  Several  methods  for  determining 
these  constants  are  given  in  [17],  each  with  its  particular  advantages.  (They  are  written  as 
differences  to  facilitate  the  discussion  below,  where  actual  values  of  y k  and  8*  are  derived. 
Note  that  these  are  not  the  same  values  as  those  defined  for  fixed-weight  disaggregation.) 

To  use  the  approximation  (7)  to  construct  an  objective  function  for  the  aggregate  prob¬ 
lem,  one  must  specify  several  quantities.  First,  since  the  approximation  must  be  made  before 
solving  (4),  the  T,  must  be  estimated.  That  is,  for  each  k ,  one  must  estimate  the  proportion  of 
y£°  which  comes  from  each  source  /,  and  use  these  estimates  in  place  of  the  true  proportions  in 
the  definition  of  T,  above.  Second,  qk  includes  distribution  costs,  ami  these  must  be  deleted 
from  (7)  in  order  to  define  ak.  For  this  purpose  estimate  weights  w,  to  approximate  yflyk °. 
(For  example,  given  ry,  drop  the  constraints  of  (5)  and  solve  for  each  j  separately.  Then 
choose  Wj  proportional  to  the  resulting  y;.)  If  the  estimates  of  wt  and  T,  were  exact,  then  the 
distribution  cost  of  yf,  j  €  Tk>  would  be 

f>,°-  2>,r,v°. 

Tk 

Thus,  set 

h 

and  use  this  in  conjunction  with  the  constants  determined  as  in  [17]  to  define  yk  and  8*.  With 
these  values  we  can  define  aggregate  shortage/disposal  costs: 

(8)  aflyf)  -  -  y(y(  +  (y£  ~  W  ft  F^)d(. 

h 

Finally,  use  the  same  *,  to  construct  aggregate  distribution  costs,  as  in  fixed-weight  disaggrega 
tion,  i.e., 

c,l  -  Y.WJC‘J‘  '  €  l-  k  6  K 
h 

Several  remarks  on  this  approximation  scheme  are  in  order.  First,  the  definition  (8) 
implies  that  the  aggregate  problem  is  a  TPUD  with  the  same  distributions  of  aggregate  demand 
F?  as  result  from  fixed-weight  disaggregation;  only  the  constants  differ.  Second,  as  shown  in 
[171,  even  though  (7)  is  derived  by  fitting  certain  properties  of  qk  at  one  or  two  given  values  of 
y£,  the  approximation  is  close  globally,  in  a  certain  sense.  Third,  the  approximation  is  nearly 
exact  if  each  of  the  sets  (>,  -  T,  :  ;  €  7"*}  and  {y,  -  8,  :  j  €  Tk)  has  nearly  identical  elements. 
Fourth,  if  the  assumption  that  dl  yf  >  0  should  happen  to  be  false,  the  approximation  still  has 
some  desirable  properties,  though  they  are  weaker  than  the  above.  Fifth,  the  approximation 
method  (but  not  the  results  above  for  fixed-weight  disaggregation)  can  he  readily  extended  to  a 
case  more  general  than  (Al)  which  includes  all  normal  distributions.  Specifically,  assume 
Fj(( )  -  //*!(£  -  Hj)/(Tj]  for  positive  constants  Hj  and  <xy,  j  6  Tk.  (If  we  choose  Hk  with 
zero  mean  and  unit  variance,  of  course,  n,  is  the  mean  of  /},  and  o,  is  its  standard  deviation.) 
Then,  (7)  still  provides  a  good  approximation  of  qk,  in  the  sense  of  [17],  where  Ff  is  redefined 
as  Ft(Z)  -  Hk[(Z  -  fik)/(Tk],  and 

T, 

The  approximation  (7)  does  not  necessarily  result  in  an  upper  bound  on  qk ,  so  no  particu¬ 
lar  relationship  between  z°  and  z*  can  be  assumed. 
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4.  ESTIMATING  AGGREGATE  DEMANDS 

Define  Gk  to  be  the  actual  distribution  of  the  sum  of  the  demands  at  destinations 
j  €  Tk,  k  €  K. 

Suppose  one  formulated  a  TPUD  from  scratch  at  the  aggregate  level,  that  is,  with  destina¬ 
tions  k  €  K.  It  would  be  natural  to  define  the  procurement/disposal  cost  functions  a*1  in  terms 
of  the  distributions  Gk.  Similarly,  if  one  were  trying  to  estimate  demands  for  an  aggregate-level 
CTP,  one  would  normally  seek  to  estimate  the  true  total  demands.  From  a  Bayesian  viewpoint, 
such  estimation  would  be  based  on  the  functions  Gk. 

In  either  interpretation,  smaller  correlations  among  the  individual  demands  would  be 
reflected  in  a  smaller  variance  for  Gk.  This  fact  expresses  the  Bayesian  version  of  the  standard 
classical-statistical  (and  intuitive)  result  that  an  aggregate  index  is  "easier"  to  estimate  than  its 
individual  components.  (The  Fs  and  the  Gk  can  be  interpreted  as  a  posteriori  distributions  based 
on  the  same  data  set.)  Statements  in  the  literature  (e.g.,  (7],  (14])  implying  that  aggregation 
reduces  data  requirements  (and,  we  suspect,  much  current  practice)  reflect  the  same  idea. 

In  the  analysis  of  the  previous  section,  however,  it  was  shown  that  Fk  is  the  appropriate 
distribution  to  use  at  the  aggregate  level  under  both  fixed-weight  and  optimal  disaggregation, 
while  Gk  is  identical  to  Ff  only  if  demands  are  in  fact  perfectly  correlated.  Otherwise,  while 
they  have  the  same  mean,  Fk  has  a  larger  variance.  That  analysis  thus  appears  to  conflict  with 
the  observations  above.  As  we  shall  now  discuss,  this  conflict  reveals  the  limitations  of  our 
analysis,  but  it  also  suggests  that  the" natural"  use  of  Gk  at  the  aggregate  level  is  often  inappropriate. 

The  mechanics  of  aggregation  and  disaggregation  bear  a  strong  resemblance  to  a  partially 
centralized  distribution  system.  In  the  latter,  typically,  yk°  would  be  sent  first  to  a  central  ware¬ 
house,  and  only  then  divided  up  among  the  individual  destinations  in  Tk.  The  most  important 
advantage  of  such  a  system,  of  course,  is  that  some  of  the  distribution  decisions  can  be  post¬ 
poned  until  the  individual  demands  are  partially  realized;  or,  in  other  words,  the  uncertainties 
of  the  individual  demands  can  be  pooled  and  their  costs  thus  reduced.  See,  for  example, 
Eppen’s  recent  study  of  the  effects  of  centralization  on  newsboy  problems  (1], 

The  point  is  that  no  such  centralization  occurs  in  the  scenarios  analyzed  above.  The 
results  of  aggregation  and  disaggregation  (even  optimal  disaggregation)  pertain  to  a  situation 
modeled  by  problem  (1);  in  particular,  all  distribution  decisions  must  be  implemented  before 
demand  is  realized.  Aggregation  is  an  operation  performed  on  a  model ,  and  unless  it  is  accom¬ 
panied  by  actual  centralization  in  practice ,  the  advantages  of  centralization  mentioned  above  are 
not  realized.  In  short,  aggregation  and  centralization  must  be  carefully  distinguished,  and  the 
"conflict"  above  results  from  confusing  the  two. 

A  similar  conclusion  holds  if  we  put  aside  the  Bayesian  framework  and  consider  the  more 
familiar  problem  of  collecting  data  to  (classically)  estimate  demands  for  use  in  a  CTP.  The 
standard  result  concerning  aggregate  demands  mentioned  above  can  be  phrased  as  follows: 
Fewer  observations  are  required  to  estimate  total  demand  with  a  given  level  of  precision  than  to 
estimate  the  individual  demands  with  the  same  level  of  precision. 

This  argument  fails  in  the  present  case  precisely  because  of  the  need  for  disaggregation, 
which  requires  estimates  at  some  level  of  precision  of  the  individual  demands.  Indeed,  any 
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disaggregation  method  will  equate  total  shipments  to  individual  destinations  and  their  respective 
estimated  demands,  so  it  is  the  precision  of  the  latter  estimates  that  is  important;  the  precision 
of  the  estimate  of  total  demand  is  irrelevant.  In  sum,  the  use  of  an  aggregate  model  per  se 
without  centralization  of  the  system,  does  not  reduce  data  requirements  at  all. 

It  is  no  doubt  often  true  that  a  relatively  simple  aggregate  level  model  is  used  "for  plan¬ 
ning  purposes"  in  situations  where  in  fact  some  centralization  is  possible.  The  analysis  above 
does  not  apply  to  such  cases.  As  is  well  known,  to  model  any  such  centralized  distribution  sys- 
um  requires  a  multi-stage  stochastic  program.  (See,  e.g.,  [9]  and  the  references  therein.)  An 
an  lysis  of  the  effects  of  aggregation  on  such  models  would  be  most  interesting,  but  is  outside 
thi  scope  of  this  paper. 

5.  BOUNDS 

A  Posteriori  Bound  Suppose  the  aggregate  problem  (4)  is  defined  according  to  fixed-weight 
disaggregation,  and  then  solved.  We  now  derive  an  a  posteriori  bound  on  z°  —  r*,  using  the 
solution  of  (4),  but  not  that  of  (1).  First,  the  following  relations  are  straightforward: 

-°  -  £W0  +  2»C5)  + 

•  j  j 

u  j 

0  2  z*ris,  -  2>*|  +  2>;>. 

i  i  j  i  j  i 

Combining  these  expressions,  and  rearranging  terms,  we  obtain 
z°  -  z*  <  £W°  -  2>tr/<0 

i  i 

+  +  p*<5)  ~  ci?xo 

ij 

+  +  ZaM0) 

J  i 

-  2»*o)  -  !«/>;>• 

i  j 

The  first  two  terms  cancel.  The  next  term  is  less  than  or  equal  to 

IW(?>  +  dj)  -  I +  dj)y], 
j  i  J 

where  dj  -  max, {it/40  -  cy},  j  €  J.  Thus, 

(9)  z°-z*<  +  I,aM) 

J  i 

+  W,  - 1^0-;). 

j  r 

so 

(10)  *°  -  z"  <  2Kp*<5>  +  <*/ O',0)) 

j 

-  £  min  {- djyj  +  a,(yy)  :  y;  >  0). 

j 
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This  is  our  bound.  The  solution  of  the  .Ah  minimization  is  finite,  since 
rr/10  -  c,j  <  —c,j  <  -6 J,  i  €  /, 
so 

y,  +  dj  <  -  8r 

The  minimum  is  achieved  at 

inf  (y,  :  Fjiyj)  >  (y,  +  tf;)/(y,  -  8,).  y,  >  0}, 
so  the  bound  requires  relatively  little  extra  computation. 

Under  the  second  form  of  optimal  disaggregation,  suppose  that  our  approximation  of  qk  is 
exact  at  the  optimum,  and  in  particular,  the  Wj  used  to  construct  the  aggregate  objective  func¬ 
tion  are  exactly  correct.  Then,  the  expression  above  for  z°  holds,  so  the  bound  (10)  applies  in 
this  case  also.  Even  when  these  conditions  cannot  be  ensured,  computation  of  the  right-hand- 
side  of  (10)  may  provide  a  useful,  if  approximate,  measure  of  suboptimality. 

A  Priori  Bound.  We  now  derive  a  bound  which  does  not  require  problem  (4)  to  have  been 
solved,  assuming  (Al)  and  (A2).  For  this  purpose  we  first  require  a  priori  upper  and  lower 
bounds  on  y*  itself: 

First,  y*  is  certainly  no  greater  than  would  be  the  case  if  we  could  ship  without  restriction 
from  the  source  having  minimal  shipping  cost  to  j.  Also,  y*  is  limited  by  total  supplies.  Thus 
y*  <  y/  where  y*  is  the  smaller  of  J\v,  and 

I 

inf  {y,  :  y,  ^  0.  F,(yj)  >  =  (y,  -  minjq,})/^  -  8;)}. 

To  obtain  a  lower  bound,  let  c+  =  max(y!cy)  and  let 

y,~  -  inf  {y,  :  Fj(yj)  2*  (y,  -  c+)/(y;  -  8,),  y,  >  0}. 

Then,  y~  is  a  lower  bound  on  y*,  //the  sum  of  the  y~  thus  computed  is  no  greater  than  J  s,. 

i 

Actually,  these  lower  bounds  can  be  considerably  improved,  if  we  can  determine  a  priori  that 
certain  x*  -  0.  Letting  5  be  the  set  of  (/',  j)  which  cannot  be  so  eliminated,  redefine 
c+  -  max  (c0  :  (i,  j)  €  5).  Again  the  yf  derived  from  this  c+  are  valid  lower  bounds,  subject 
to  the  condition  mentioned. 


Proceeding  to  the  bound  on  z°  -  z*,  lei  us  rewrite  (9)  as  follows: 


(11) 


-*•<  Z 


-hj  -  Pk(°j)bj°  +  (y/  -  s j)  / '  Fj(£)d( 


-  Z \-(yj  +  dj)yJ+  (y,  -  6j)  /_>,(*) <4 


The  first  summation  equals 


(12) 


Z! 

k 


+  vA0 

-iyi  ~  pC°W°  +  hk  -  S  L 
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But,  under  (Al)  and  (A2),  for  each  k, 

„yA0  yAO  y  o 

S.l  w#  -  i%/_i  wo#  - 1/_:  wdt, 

Tk  Tk 

so  (12)  equals 

(13)  I  -<y*%  -  oftM0  +  <y&i  ~  8*o, >  /_£  • 

j 

Moreover, 

f>,°>  -  /?o,  0$, ).  y  €  / 

so  in  fact  y,°  minimizes  the  yth  term  in  (13).  Thus,  if  we  substitute  y*  for  yf  in  (13),  the  ine¬ 
quality  still  holds.  Also, 

Vy  +  «!i“  V,  “  P*®>  +  max,{p*o)  4-  irf°  -  c(>) 

<  -  Pk(°j ,  +  . 

where  e,  -  max,{c^(y)  -  c,;),  by  the  optimality  conditions  for  the  aggregate  problem.  In  sum, 
from  (11)  we  obtain 

51  {— (y**o»  ~  y j  ~  ej)yl 

+  Ky*V,>  -  Vy)  -  (8*o>  -  8y)l  SI 
so 

(14)  z°  -  r*  <  £  max  j— (y^(>,  -  y,  -  t’yb'y 

+  (( yf<y>  -  yy)  -  (8*o*  -  Sy )  1  /_*  F,  (*)«/*  -.yr<yj<  y? ) 

Expression  (14)  is  the  o  priori  bound.  In  the  ^h  maximand,  if  the  term  in  square  brackets 
is  negative,  the  maximand  is  concave,  and  the  maximum  is  easily  computed.  If  this  term  is 
nonnegative,  the  maximand  is  convex,  and  the  maximum  is  achieved  at  one  of  the  endpoints. 

A  qualitative  analysis  of  (14)  suggests  a  criterion  for  aggregation  which  is  consistent  with 
earlier  work  on  deterministic  problems  [5,  13]:  Suppose  the  costs  {c,,  :  j  €  7"J  are  nearly  ident¬ 
ical  for  each  /,  and  that  each  set  { y }  :  j  €  Tk )  and  {8,  :  j  6  Tk)  has  nearly  identical  elements  as 
well.  Then,  each  constant  in  parentheses  in  (14)  is  close  to  zero  for  j  €  Tk,  so  these  terms 
contribute  little  to  the  overall  bound.  This  observation  suggests  aggregating  destinations  whose 
corresponding  cost  coefficients  are  all  similar.  The  size  of  the  demands  at  the  respective  desti¬ 
nations  appears  to  be  irrelevant. 

A  Posteriori  Bound  for  Optimal  Disaggregation,  First  Form:  Williams  [12]  derives  a  bound  on 
the  suboptimality  in  (1)  of  any  y,  together  with  the  optimal  solution  of  the  CTP  with  demands 
y.  Clearly,  this  result  can  be  applied  to  (x°,  y°),  derived  from  the  solution  to  (5)  by  the  first 
form  of  optimal  disaggregation  described  above.  Let  z'  be  the  total  cost  of  this  solution,  and 
(«•',  p ')  the  optimal  dual  solution  of  the  CTP.  Then,  the  bound  is  as  follows: 
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-<>,  -  pPW  -  y'j)  +  (>j  -  8;)  /;  f/f 

where  y,'  -  inf  (y;  :  y,  >  0,  Fjiyj)  -  (y ,  -  p/)/(y;  -  8,)},  j  €  /  Note  the  similarity  in  form 
(and  hence  computational  effort)  of  (15)  and  the  other  bounds  derived  above.  Under  assump¬ 
tion  d,  a  weaker  bound,  which  is  even  simpler  to  compute  however,  is  also  due  to  Williams 
[12]: 

(16)  z'  -  z*  <  £(y,  -  8 j)  ( yf  -  y'j)  [Fj {yf)  -  F>  { y J)  1 . 

j 

It  would  be  useful  to  be  able  to  compute  an  a  priori  bound  on  z*.  Since  z'  <  z°,  the 
bound  (14)  applies  to  z'  -  z*.  We  have  not  found  a  stronger  such  result. 
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ABSTRACT 

This  paper  studies  certain  "second  order"  parametric  relations  in  capacitated 
transportation  problems.  These  relations  concern  the  question  of  what  happens 
to  the  effect  of  a  parameter  (first  derivative)  as  another  parameter  is  varied. 
These  relationships  have  been  found  quite  useful  in  the  solution  of  many  types 
of  facility  location  and  capacity  expansion  problems.  The  paper  presents  several 
results  on  the  parametric  behavior  of  the  dual  multipliers  from  which  second 
order  parametric  relations  can  be  derived. 


1.  INTRODUCTION 

This  paper  is  concerned  with  the  study  of  certain  "second  order"  parametric  relations  in 
capacitated  transportation  problems: 

(P):  Min  Z  —  £  c0  x0 

i.J 

( 1 )  Subject  to  ]£  Xjj  <  5,  /  —  1  ....  n 

j 

(2)  £  Jty  -  Dj  j  -  1  ....  n 

i 

(3)  Ljj  <  x,j  <  M,j  i  —  1  ....  n  j  —  1  . . .  ,  m. 

In  the  above  formulation  ctJ  denotes  unit  shipping  costs;  S,  and  Dj  are  the  magnitudes  of 
availabilities  and  requirements,  respectively;  and  and  MtJ  O  0)  represent  bounds  on  the 
shipments,  XtJ.  Specification  of  the  way  in  which  Z  changes  with  respect  to  the  constants  of 
(P)  is  commonly  known  as  parametric  analysis.  To  discuss  different  types  of  parametric  rela¬ 
tionships  let  a*  stand  for  any  one  of  the  constants  (data)  of  (P)  which  is  being  parameterized, 
and  Z (ak)  be  the  optimal  cost  function.  A  common  way  to  generate  this  function  is  to  evalu¬ 
ate  dZ/dak  for  various  ak  levels  while  holding  all  other  data  fixed.  We  characterize  this  well 
known  relationship  between  ak  and  Z  (ak)  as  a  simple  parametric  relation.  If  we  now  allow 
another  problem  data  to  vary  also,  and  denote  its  various  levels  by  ar,  we  may  define  a  second- 
order  parametric  relationship  (SOPR,  henceforth)  as  the  specification  of  how  a  change  in  ar 
affects  the  function  Z(ak)  over  its  entire  domain.  The  "shift"  in  Ziak)  due  to  a  change  in  ar 
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may  be  specified  through  the  derivative  of  bZ/bak  with  respect  to  i.e.,  through 
b7Z/bak  bar.  The  well-known  result  that  Z(ak)  is  convex  in  ak  is  an  example  of  a  SOPR,  in 

the  above  sense,  since  the  convexity  of  2  (a*)  is  equivalent  to  the  statement:  — r  <  0  V  ak. 

bat 

However,  this  paper  considers  more  general  relationships  in  the  form  of  b2Z/bak  bar  where 
k  ^  r,  from  which  further  useful  results  may  be  derived. 


We  were  not  able  to  locate  many  papers  which  directly  address  the  SOPR  in  transportation 
problems.  Erlenkotter  14]  gives  several  specific  SOPR  results  in  the  uncapacitated  transporta¬ 
tion  problem.  We  will  discuss  some  of  these  results  in  Section  3.  Another  relevant  work  is  by 
Shapley  [10]  who  studied  similar  relationships  occuring  in  assignment  problems. 

The  ability  to  judge,  without  re-solving  the  problem,  whether  the  changes  in  two  (or 
more)  parameters  reinforce,  dampen  or  leave  unchanged  one  another’s  individual  effects  may  be 
quite  useful  in  post-optimality  analyses  of  ( P ).  As  we  shall  demonstrate  later,  such  interactive 
effects  can  be  derived  from  the  SOPR  in  a  straightforward  way.  An  even  more  important  use 
of  these  interactive  effects  can  be  made  in  the  study  of  large  problems  in  which  some  version 
of  ( P )  appears  as  a  sub-problem.  For  example,  in  determining  and  optimal  capacity  expansion 
schedule  for  a  multi-plant  system  (4,5,6),  it  is  imperative  to  know  how  a  change  in  the  capacity 
of  one  source  affects  the  potential  benefits  of  expanding  the  capacities  of  some  others.  A  simi¬ 
lar  situation  exists  in  solving  the  capacitated  facility  location  problems  [1,8,9],  One  needs  to 
take  into  account  the  fact  that  potential  benefits  of  adding  new  facilities  or  dropping  existing 
ones  depend  on  the  current  set  of  facilities. 

In  the  next  section  we  introduce  some  definitions  and  preliminaries  on  the  structure  of 
(P).  Section  3  gives  several  specific  SOPR  and  shows  how  these  may  be  used  to  determine  the 
interactive  effects  of  simultaneous  parameter  changes. 

2.  PRELIMINARIES 


Since  bZ(ak)/bak  can  be  interpreted  as  a  dual  multiplier,  a  SOPR  can  be  studied  from 
the  parametric  behavior  of  the  dual  multipliers.  For  instance,  if  Sk  and  Sr  are  the  two  problem 
data  in  (P)  for  which  a  SOPR  is  to  be  studied,  and  if  uk  denotes  the  dual  multiplier  associated 

a  t  a? 

with  the  Jc,h  supply  constraint,  the  SOPR  of  interest  is  simply  the  function  — —  — £-|  or 

bSr  |  bSk 

equivalently  buk/bSr. 


In  order  to  facilitate  the  analysis  of  the  behavior  of  the  dual  multipliers,  some  well  known 
graph  theoric  properties  of  the  solutions  to  (P)  and  some  definitions  are  given  below: 


Let  B  be  a  set  of  arcs  (y)  constituting  a  feasible  basis  for  (P).  B  and  the  corresponding 
flows  x,j  are  optimal  to  (P)  if  nonnegative  dual  multipliers  u,  and  vy  can  be  found  respectively 
for  Equations  (1)  and  (2)  of  (P)  satisfying  (3,  p.  377]: 

(4)  u,>  0  —  £  -  Si 

j 

(5)  v,  -  u,  >  ctJ  —  x,j  -  MfJ 

(6)  v,  -  u,  <  cu  —  x„  -  L0 

(7)  v,.  -  u,  -  c„  —  L,j  x,j  Mir 
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Arcs  satisfying  (5)  and  (6)  are  nonbasic  at  their  upper  and  lower  bounds  respectively  while  those 
satisfying  (7)  are  basic.  Given  the  dual  multipliers  u,  and  vy,  the  dual  multipliers  t0  and  l0 
associated  with  restrictions  x0  <  M0  and  x0  >  Lu  are  implicit  in  (4)-(7)  and  can  be  obtained 
as: 

t,j  -  Max  {-( t,j ,  0} 

I,,  -  Max  | <ru,  0) 
where  -  c0  +  «,  -  vy. 

If  B  is  a  basis,  then  its  columns  form  a  spanning  forest  consisting  of  spanning  trees  each 
corresponding  to  nonbinding  Equation  (2)  [7],  However  if  a  set  of  slack  variables  is  associated 
with  a  dummy,  ( m  +  l)sl  sink,  then  each  basis  of  (P)  corresponds  to  a  unique  spanning  tree 
which  connects  every  node  (supply  and  demand  points)  to  every  other  node  without  any  cycles. 
Every  nonbasic  arc  can  be  represented  in  terms  of  the  tree  arcs  by  using  a  (+1,  -1,  0)  valued 
vector.  An  exchange  of  a  basic  and  a  nonbasic  arc  yields  an  adjacent  basis,  i.e.,  for  ( k,r )  €  B° 
and  (4.  jb)  9  B°,  B]  —  B°  -  {(£,r)|  U  ((4,  j6)j  is  an  adjacent  basis  if  it  is  a  basis.  With 
respect  to  two  adjacent  bases  B°  and  B\  define  two  arc  sets: 

B+  -  B°  U  B' 

B-  -  B°  n  B'. 

The  union  of  the  two  bases,  B*  has  a  unique  cycle  of  arcs  denoted  by  C  whose  member 
arcs  are  numbered  sequentially  beginning  with  the  leaving  arc  (r,k).  The  intersection  of  the 
two  bases,  ET  divides  the  nodes  into  two  groups  (as  shown  by  the  vertical  bar  in  Figure  1): 
Those  nodes  which  are  connected  to  node  r  by  some  arcs  in  B~:  and  those  which  are  connected 
to  node  k  by  some  arcs  in  B~ .  These  node  sets  are  denoted  by  /Vrand  Nk  respectively. 


- -  B1 

Figure  la.  —  The  incoming  arc  is  even  numbered 
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- .8o 

- Bi 


Figure  lb  —  The  incoming  arc  is  odd  numbered 


3.  PARAMETRIC  RELATIONSHIPS 

The  first  part  of  this  section  will  examine  the  parametric  behavior  of  the  dual  multipliers 
with  respect  to  a  problem  constant.  We  will,  then,  show  how  these  results  may  be  used  to 
derive  the  interactive  effects  when  certain  problem  data  are  parameterized  simultaneously. 

THEOREM  1.  When  an  arc  capacity,  say  Mrk ,  is  reduced,  then: 

(i)  /,,,  for  all  y,  and  ilk  for  all  /cannot  decrease; 

(ii)  lrj,  for  all  j ',  and  llk  for  all  /cannot  increase. 

PROOF:  Since  any  magnitude  of  change  can  be  considered  as  a  sum  of  smaller  changes, 
it  is  sufficient  to  prove  the  result  for  some  A  (a  reduction  in  Mrk)  which  is  of  sufficient  magni¬ 
tude  to  require  exactly  one  basis  change  for  restoring  optimality.  Reducing  Mrk  by  A  renders 
the  current  optimal  solution  x,®  infeasible.  To  restore  optimality  the  dual  simplex  algorithm 
removes  arc,  (r,k)  and  locates  a  new  arc,  (4,  jb)  to  enter  the  basis.  The  choice  of  A  and  the 
entry  criterion  of  the  dual  simplex  algorithm  guarantee  optimal  primal  and  dual  solutions.  The 
primal  solution,  x<J  corresponding  to  the  new  basis  B]  is: 

x,®  +  A  for  (ij)  €  C,  and  even  numbered 

x,)  -  x,®  -  A  for  ( i.j )  €  C,  and  odd  numbered 

x,®  for  (i.j)  9  C 

as  can  be  seen  from  Figure  la  or  lb. 
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CASE  (a):  If  the  incoming  arc  is  an  odd  numbered  member  of  C,  then  and 

<  0  (see  Fig.  la).  To  satisfy  (7)  for  (4,  jb),  cribJb  —  c,^,  +  u,b  -  v>6  must  be  increased  to 
0.  If  (m  +  1)  €  N',  then  u,b  must  increase,  otherwise  v>4  should  decrease  to  increase  and 
preserve  (4).  We  may  assume  u,b  increases  by  8  -  —a lhJ  without  loss  of  generality  for  if  (4)  is 
violated  by  this,  (8)  can  be  subtracted  from  all  u,  and  v;  to  restore  (4)  leaving  all  t:J  and  l(J 
intact.  Furthermore,  to  maintain  (7)  on  the  other  basic  arcs,  the  multipliers  of  all  the  nodes  in 
A*  must  increase  by  8  since  4  €  A7*. 

CASE  (b):  If  (4.  jb)  is  an  even  numbered  arc  of  C  then  xfo  -  Z.,.^  and  0 >  0  (see 
Fig.  lb).  To  satisfy  (7)  for  (4,  jh).  vt  must  increase  by  8  -  <r,  y  (again  assume  (4)  remains 
valid  with  no  loss  of  generality).  Also  to  maintain  (7)  for  the  other  basic  arcs,  the  multipliers 
of  all  the  nodes  in  Nk  must  increase  since  jb  6  Nk. 

fu,°  +  8  ,  €  /V\  K°  +  8  yf  Arlt, 

“  jii,0  /  €  N'\  and  V'  ~  |v°  j  €  AT. 

Consequently,  the  definitions  of  /(J  and  /„  lead  to: 


[Max 

1  -<r° 

+  8.0}^  /° 

j 

6  Nk 

In  ~ 

I'J 

j 

6  Nr 

\ 

8? 

i 

€  A'* 

U  ” 

Max 

l-*2 

+  8.0}^  ft 

i 

€  Nr 

In  ~ 

[Max 

- 

8,  0}  ^  ft 

J  e 

Nk 

I/O 

\'n 

J  € 

Nr 

•  I  1 

1,1 

/  6 

Nk 

U  “  1 

Max 

W  - 

•  8.  0)  $  ft 

;  € 

Nr 

THEOREM  2:  If  a  lower  bound,  Lrk  is  reduced,  then 

(i)  trr  for  all  j,  and  ilk  for  all  i  cannot  increase; 

(ii)  lrJ  for  all ./',  and  l,k  for  all  (cannot  decrease. 

PROOF.  Using  the  same  argument  as  before,  we  conclude  that  when  the  flow  is  forced 
up  on  arc  ( r .  k)  it  leaves  the  basis  by  reaching  its  upper  bound  and  either  an  even  numbered 
arc  increases  its  flow  beyond  its  lower  bound  or  an  odd  numbered  arc  reduces  its  flow  below  its 
upper  bound  to  become  basic.  In  either  case,  i  €  N'  and  v/t  j  €  N'  increase  by  6. 
Definitions  of  r,y  and  l0  lead  to: 


(  (Max  (-a-0*  -  8,  0}  <  ft  j  €  Nk 
*’>  ~  |f°  J  €  Nr 

I*  i  6  Nk 

“  Max  {-o-a  -  8,  0)  <  ft  if  (V' 
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Max  ( a n  +  8, 

0)  >  if, 

J  €  Nk 

J  €  N' 

[42 

j  6  Nk 

[Max  | crlk  +  8, 

0}  ^  /,? 

j  €  Nr 

The  above  results  establish  that  r(J  and  /„  are  monotonic  in  Mrk  and  Lrk ,  if  either  /  =  ror 
/  =  A.  Multipliers  r,;,  /„.  m,  and  v(  may  not  be  monotonic  for  some  /  ^  r  and  some  y  ^  k 
since  at  successive  iterations  of  the  dual  simplex  algorithm,  /  and  ./may  not  consistently  belong 
to  the  same  sets  Nr  or  Nk  and  hence  u,  or  v,  may  increase,  causing  and  /„  to  vacillate,  at 
different  iterations. 

We  now  show  that  a  result  due  to  Erlenkotter  [4)  can  be  viewed  as  special  cases  of 
Theorems  1  and  2.  This  result  concerns  the  behavior  of  dual  multipliers  for  the  uncapacitated 
problem  when  the  supply  and  demand  figures  are  altered. 

THEOREM  3:  (Erlenkotter  [4)1  The  optimal  dual  multipliers  u,  and  v,  for  a  transporta¬ 
tion  problem  UP)  without  Equation  (3))  are  nondecreasing,  if  either. 

(a)  D,  are  constant,  and  .S',  decrease,  or 

(b)  S'  are  constant,  and  D,  increase. 

PROOF:  To  prove  (:>  •  t  ./  =  m  +  1  represent  the  slack  column  of  the  transportation 
problem.  The  definition  .„  leads  to  lim+\  =  un  reducing  Sk  can  be  viewed  as  increasing 
Lk  m.|.  From  Theorem  2  tne  result  follows. 

To  prove  < b >  let  /  -  n  +  1  be  art  artificial  source  with  c„+1  ,  ==  0,  M„+]  ,  *=  0  Vj,  and 
S„.|  =  °°.  This  modified  problem  is  equivalent  to  the  original  one.  The  definition  of  r,y  leads 
to  |  ,  =  v;.  Reducing  Dk  can  be  viewed  as  increasing  M„+\.k-  From  Theorem  1  the  result 
follows. 

This  proof  also  extends  Erlenkotter's  14)  results  to  the  capacitated  transportation  prob¬ 
lems. 


Using  the  same  reasoning  employed  in  Theorems  1  and  2  one  can  study  the  behavior  of 
i  and  /„  with  respect  to  changes  in  crk.  Decreases  in  crk  can  be  viewed  as  increases  in  Mrk 
since  in  either  case  the  condition  which  triggers  adjustments  in  the  primal  and  dual  solutions  is 
\k  -  ur  >  crk  and  xrk  <  Mrk  (violation  of  (5)).  Conversely,  increases  in  crk  can  be  considered 
as  reductions  in  Lrk  since  in  either'case  \k  -  ur  <  crk  and  xrk  >  Lrk  (violation  of  (6)). 

Having  established  the  behavior  of  the  duals,  second  order  parametric  properties  can  be 
derived.  We  will  give  such  a  derivation  using  a  procedure  first  employed  by  Erlenkotter  [4]  to 
prove  that  Z(S,)  —  Z(S,  +  a)  is  nonincreasing  in  another  capacity,  5,  for  a  ^  0.  Theorem  4 
establishes,  as  an  example  of  derivation  of  interactive  effects,  the  behavior  of  the  cost 
difference  in  (P)  due  to  changing  the  capacity  of  one  arc,  as  a  function  of  a  second  arc  capacity. 
Notation  Zia.b )  and  r„  ( a.b )  denotes  the  optimal  cost  of  (P)  and  the  value  of  /„  when  the 
capacities  of  the  two  arcs  are  a  and  b  respectively. 
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THEOREM  4:  The  increase  in  Z( ,  .  .)  due  10  lightening  xrk  <  Mrk  is  nondecreasing  as 
(i)  xlk  <  M,k  for  any  /  or  (ii)  x„  <  M„  for  any  ./  is  tightened;  or  equivalently  for  any  given  a 
and  /3  >  0, 

(i)  Z ( M,k  -  «.Mr, )  ~  Z(Mrk.Mri)  ZKM,k  -  a,MrJ  ~  p)  ~  Z(Mrk.Mrj  -  p) 

(ii)  Z(Mrk  -  a.Mlk )  -  Z(M,k.Mlk >  «£  Z(A/r/>  -  a,M,k  -  p)  -  Z(Mrk.M,k  -  /3). 

PROOF:  We  will  only  prove  (i)  since  the  proof  for  (ii)  is  almost  identical.  From  the  well 
known  properties  of  the  duals  we  have. 

bZ 

bM„ 

Therefore, 

Z{M,k  -  a,  M,t)  =  Z(M,k,  M„)  +JJ  irk  ( M,k  -  8,  MrJ)dh 
Z(Mrk.  M„  -  p)  =  Z(Mtk.  M„)  +  J*  tn(Mrk,  M„  -  8) db 
and 

Z  ( Mrk  -  a,  Mn  -  p)  =  Z  ( Mrk .  A/r/)  +  fj  U  (Kk  ~  8.  Mn)d8 

+  J7  'r,(Mrk-a.  Mr,  -8) <18. 

Substituting  these  expressions  in  the  statement  of  (i)  we  get. 

/;  U,<A/M  -  «.  A/„  -  8)  -  t„(Mr,.  A/,,  -  8 > ] c/S  >  0. 

This  is  true,  since  by  Theorem  1  we  have 

tr,(Mrk.  Mr,  -  8)  tr,  ( Mrk  -  «.  Mr/  -  8)  for  0  8  p. 


A  corollary  of  the  above  has  been  applied  to  devise  pruning  tests  in  a  branch  and  bound 
algorithm  for  a  multi-activity  facility  location  problem  [2], 

COROLLARY:  The  cost  increase  due  to  deleting  a  subset  A  possibly  all)  of  the  arcs 
emanating  from  a  source  r  is  no  smaller  than  the  sum  of  the  cost  increases  associated  with  dis¬ 
joint  subsets,  A i  of  A.  i.e., 

Z[P\Mr,  =  0,  j  €  A\  -  Z(P|  >  £  {Z{P\Mr,  =  0.  j  <E  A,]  -  Z{P}} 

i 

where  notation  Z\P\  A)  denotes  the  optimal  cost  of  problem  (P)  when  condition  A  is  true. 

PROOF:  Reducing  Mn.  itA,^  renders  the  penalties  due  to  reducing  M,,.  jfAu  higher 
from  Theorem  4.  The  result  directly  follows. 

An  important  conclusion  of  this  paper  is  that  one  can  analyze  the  magnitude  of  cost 
difference  for  any  two  values  of  some  parameter  as  a  function  of  another  by  studying  the 
parametric  behavior  of  the  dual  multipliers.  Consequently,  general  interactive  effects  of  param¬ 
eters  can  be  determined  to  the  extent  that  the  parametric  behavior  of  the  relevant  multipliers 
are  monotonic. 
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ABSTRACT 

The  joint  problems  of  determining  the  optimal  plant  location  and  optimal 
input  mix  and  plant  size  are  addressed.  The  interrelationship  between  input 
substitutability  and  plant  location  is  stressed.  Conditions  under  which  the  loca¬ 
tion  problem  can  be  separated  from  the  determination  of  the  optimal  input  mix 
are  developed  for  a  number  of  problem  variations.  The  stability  of  the  optimal 
location  in  the  face  of  changes  in  problem  parameters  is  also  discussed.  It  is 
demonstrated  that  consideration  of  input  substitutability  often  makes  the  result¬ 
ing  problem  no  more  difficult  to  solve  than  problem  formulations  in  which  the 
inherent  input  substitutability  is  ignored. 


INTRODUCTION 

The  joint  problems  facing  a  profit  maximizing  firm  of  determining  its  optimal  plant  loca¬ 
tion  and  the  optimal  input  mix  and  plant  size  have  historically  been  treated  separately.  There 
have  been  a  few  significant  exceptions  where  the  two  problems  have  been  treated  simultane¬ 
ously,  at  least  for  special  cases.  Perhaps  the  seminal  effort  is  that  of  L.  Moses  U4],  followed 
several  years  later  by  the  efforts  of  A.  P.  Hurter  and  R.  E.  Wendell  [7],  A.  Goldman  [6],  and 
most  recently  be  A.  P.  Hurter,  J.  S.  Martinich,  and  E.  R.  Venta  [8].  Other  contributions  to  an 
overall  understanding  of  production-location  problems  include  (but  are  not  limited  to)  the 
works  of  M.  Bradfield  [1],  R.  S.  Woodward  [22],  D.  L.  Emerson  [4],  A.  Khalili,  V.  K.  Mathur 
and  D.  Bodenhom  [9],  V.  K.  Mathur  [12]  and  S.  M.  Miller  and  0.  W.  Jensen  [13].  Here,  these 
results  are  unified  and  extended  to  the  case  in  which  initially  a  single  plant  is  to  be  located  on 
the  plane  and  different  transportation  structures  are  employed.  Later,  a  multiplant  model  is 
investigated.  The  usual  assumption  of  a  known  quantity  of  product  demanded  is  retained  for 
each  commodity.  Demand  considerations  within  a  production-location  problem  context  are 
dealt  with  elsewhere  by  E.  R.  Venta  and  A.  P.  Hurter  [19]. 
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A  SINGLE  PRODUCT  SINGLE  PLANT  PROBLEM  -  GENERAL  FORMULATION 

The  firm  is  assumed  to  seek  the  plant  location  and  the  mix  of  inputs  to  utilize  in  order  to 
minimize  the  costs  of  supplying  known  demands  at  each  of  several  markets.  The  inputs  are 
sold  at  fixed  prices  at  their  sources  and  are  transported  to  the  plant,  thereby  incurring  transport 
costs.  The  output  of  the  plant  is  shipped  to  markets  of  known  locations,  and  in  some  instances 
the  firm  incurs  transport  costs  for  the  product.  The  firm  is  constrained  by  its  available  produc¬ 
tion  technology,  which  is  summarized  in  the  form  of  a  production  function.  It  is  assumed  that 
the  production  function  can  be  represented  by  a  homothetic  function.* 

The  general  problem  under  consideration  is  represented  formally  in  PI  below: 

0.0)  min  £  Cp(x,  bm,  qm)  +  £  C(x,  aj,  Zj) 

m  J 

(PI) 

(1.1)  subject  to  £  qm  -  f(z) 

m 

where: 

x  A  the  vector  variable  denoting  the  location  of  the  plant  in  R2 
Zj  A  the  quantity  of  the  Ah  input  used,  j  —  1,  . . .  ,  J  -  a  variable 
qm  A  the  known  quantity  of  the  product  demanded  at  market  center  m,  m  —  1 ,  . . . ,  M 
bm  A  the  known  vector  location  of  the  mth  market  center  in  R2 

Cp(x,  bm,  q„)  A  the  cost  of  transporting  qm  units  of  the  product  from  location  xto  location  bm 

cij  A  the  known  vector  location  in  R1  of  the  source  of  supply  for  the  Ah  input  y  —  1.  ....  J 

C(x,  aj%  Zj)  A  the  cost  purchasing  Zj  units  of  input  j  at  location  a,  and  of  transporting  these 
units  from  location  a,  to  location  x 

/(•)  A  the  known  homothetic  production  function 

In  the  following  section,  several  versions  of  PI  will  be  discussed.  Each  version  will  intro¬ 
duce  specific  assumptions  about  the  form  of  the  objective  function  (1.0)  and/or  the  production 
function  (1.1). 

A  SINGLE  PRODUCT  PROBLEM  WITH  SPECIAL  TRANSPORT  COSTS 

Consider  a  simplified  version  of  PI  in  which  the  cost  of  transporting  the  product  to 
market  is  negligible.  In  this  case,  the  Cp(x,  bm,  qm)  terms  in  (1.0)  will  be  zero.  Assume  that 
the  costs  of  transporting  inputs  are  constant  on  a  per  unit  of  input  per  unit  of  distance  basis. 
Then  problem  PI  with  these  assumptions  may  be  written  as  problem  P2  below: 

(2.0)  min  £(/j  d(x,  a,)  +  pj)  Zj 

j 


’Notice  that  this  formulation  allows  for  the  possibility  of  more  then  one  supplier  for  any  input.  For  example,  In  a  two- 
input  Cobb- Douglas  technology 

/(*)  -  A  (i||  +  r,j)*(r2)<l_,) 

with  r  j  |  and  2|}  as  quantities  of  input  one  supplied  from  alternate  sources. 
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subject  to  z0  -  f(z) 

where: 

Pj  A  the  price  per  unit  of  input  j  at  its  source  as 

d(x,  <tj)  A  the  "distance"  between  locations  ay  and  x 

Tj  A  the  cost  of  transporting  a  unit  of  input  J  a  unit  of  distance 

z0  A  £  qm 

m 

A.  P.  Hurter  and  R.  E.  Wendell  (7]  introduced  Lagrangian  optimization  to  the  analysis  of 
this  problem  and  demonstrated  that,  with  a  certain  class  of  production  functions,  me  could  first 
solve  for  the  optimal  plant  location,  x,  and  then  for  the  optimal  input  mix,  z,  without  fear  of 
sub-optimization.  A.  Goldman  (61  later  identified  the  class  of  production  functions  used  by 
Hurter  and  Wendell  as  the  class  of  all  homothetic  production  functions.  Goldman  also  demon¬ 
strated  that  solution  algorithms  that  have  been  developed  for  the  Weber  location  problem  (i.e., 
P2  with  the  z,  all  fixed)  can  be  used  to  solve  for  the  optimal  location  in  P2.  Hurter,  Martinich 
and  Venta  [81  have  demonstrated  that  the  optimal  location  is  invariant  with  parametric  changes 
in  z0. 

A  more  convenient  form  of  the  problem,  which  allows  a  more  direct  exploitation  of  the 
properties  of  homothetic  production  function,  is  now  developed  and  used  throughout  the 
remainder  of  the  paper. 

Define: 

C(x,  zq)  -  min  J  (ryrf(x,  a})  +  pj)  Zj 

j 

subject  to  Zq  -  fiz) 

C(x,  zq)  is  then  the  minimum  cost  of  producing  z0  units  of  the  product  at  location  x 
C(x,  z0)  will  be  referred  to  as  the  "cost  function." 

LEMMA  1:  Let  /be  a  homothetic  production  function. 

Then: 

(*)  C(x,  zo)  -  h(x)  g(zo) 

PROOF:  By  definition,  a  homothetic  function  /(z)  can  be  written  as  /(z)  -  F(/f(z)), 
where  H  is  homogeneous  of  degree  one  and  F  is  an  increasing,  differentiable  transformation. 

Let  h(x)  -  min  £  (rjcHx,  ay)  +  pj)  z} 

j 

subject  to  1  -  H(z ) 

Since  H  is  homogeneous  of  degree  1,  the  minimum  cost  of  producing  z0  units  at  x  using  tech¬ 
nology  H is  Cf/ix,  zo)  -  h(x)z0.  But 

H(z)  -  F"1  (z0)  A  g(r«>).  thus  Cix,  z0)  -  Mx)  *(*<>)•  O 
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Lemma  1  is  a  spatial  version  of  Shephard’s  [7]  duality  theorem  for  homothetic  production 
functions. 

THEOREM  1 :  If  the  production  function  in  P2  is  homothetic,  then  a  single  optimal  loca¬ 
tion  exists  for  all  output  rates.* 

PROOF:  Using  Lemma  1  (P2)  can  be  rewritten  as  min  h(x)  g(z0)  which  implies  that 
solving  min  h(x)  will  yield  the  optimal  location.  □ 

As  the  result  of  Theorem  1,  P2  can  be  solved  directly.  As  a  first  step,  the  form  of  the 
marginal  cost  function,  h(x ),  must  be  determined,  this  function  is  minimized  xith  respect  to 
the  vector  location  variable,  x  The  solution,  x*,  is  optimal  for  all  levels  of  output.  Solving  P2 
with  x  —  x*will  yield  the  optimal  mix.t 

Embedded  within  this  solution  procedure  is  the  solution  of  a  location  problem  min  h(x). 
As  in  the  ordinary  or  Weber  location  problem,  the  solution  scheme  must  be  iterative  in  nature. 
The  basic  solution  scheme  for  the  Weber  problem  was  developed  by  L.  Cooper  [3]  and, 
independently,  by  H.  W.  Kuhn  and  R.  E.  Kuenne  [10]  for  convex  objective  functions.  A. 
Goldman  [6],  in  addition  to  pointing  out  that  h(x)  generally  is  not  convex,  extended  the 
scheme  developed  by  Kuhn  and  Kuenne  [10]  to  account  for  local  minima  at  singular  points 
(i.e.,  the  sources  of  the  inputs),  and  provided  a  converging  fixed  point  iterative  algorithm  to 
find  local  minima  at  other  than  singular  points.  It  appears  then  that  solving  min  h(x)  is  not 
significantly  more  difficult  then  solving  the  Weber  problem. 

Next  consider  a  case  in  which  the  firm  must  pay  for  the  transport  of  its  product  to  market, 
so  that  the  Cp(x,  bm,  q„)  terms  of  PI  are  no  longer  zero  as  they  were  in  P2.  Instead,  let 
Cp(x,  bm,  qm)  -  rd(x,  bm)qm.  Then,  PI  can  be  written: 

(3.0)  min  ]£  rd(x,  bm)  qm  4  £(r,rf(x,  dj)  4  Pj))zj 

m  J 

(P3) 

(3.1)  subject  to  ^  qm  -  z0  -  f(z) 

m 

Making  use  of  Lemma  1,  P3  can  be  rewritten  as 

(3.2)  min  T  r  d(x,  bm )  qn  4  h(x )  g(r0) 

*  m 

The  objective  (3.2)  contains  only  the  location  variable  xand  thus,  as  in  the  case  of  problem  P2, 
it  establishes  a  separability  between  production  and  location  decisions  so  that  first  a  problem 
involving  only  the  location  variable  can  be  solved  and  subsequently  the  optimal  input  mix  can 
be  determined.  However,  in  problem  P3  the  objective  (3.2)  includes  both  the  level  of  output, 
z0,  and  the  demands  at  each  market,  qm.  In  general,  the  optimal  location  will  change  as  the  qm 
and/or  z0  change. 


Theorem  l  *u  proved  in  18]  in  *  different  form  end  is  included  here  for  completeness. 

'the  explicit  determination  of  h(x)  is  necessary  for  the  solution  procedure  described.  Wxl  has  been  derived  explicitly 
for  Cobb-Douflas  functions  in  [7]  and  for  CES  functions  in  [6].  Goldman  [(]  also  gives  general  conditions  for  deter¬ 
mining  h(x).  However,  even  if  h(x)  cannot  be  determined  explicitly,  the  characterisation  of  the  cost  function  given  in 
Lemma  1  is  useful  in  developing  properties  of  the  location  decisions. 
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If  the  production  function,  /,  is  homogeneous  of  degree  one,  then  g(z0)  and  z0  are  the 
same  in  the  preceding  formulation  and  (3.2)  can  be  written  as: 

(3.3)  min  £(r  d(x,  bm)  +  A(x))  qm. 

m 

The  optimal  location  is  now  insensitive  to  the  level  of  output  as  long  as  demands  increase  or 
decrease  proportionally  over  all  markets  (ie.,  as  long  as  qjqt  is  constant  for  all 
i,  j  «■  1,  ....  m).  Note  that  in  the  case  of  only  one  market  the  same  location  is  optimal  for  all 
output  levels.  To  summarize: 

THEOREM  2:  If  the  production  function  in  P3  is  homothetic,  then  the  production- 
location  problem  can  be  written  as  a  problem  in  the  location  variable  only  given  by  (3.2). 
Further,  if  the  production  function  is  homogeneous  of  degree  one  and  any  change  in  output 
level  is  distributed  proportionally  to  all  markets,  then  a  single  location  will  be  optimal  for  all 
output  levels. 

EXTENSION  OF  THE  SINGLE  PLANT  PROBLEM 

Consider  problem  P2  with  nonlinear  (in  distance)  transportation  costs,  so  that  terms  of 
the  form  r}  d(x,  av)  are  replaced  by  7}(d  (x,  ay))  where  Tj  is  a  general  transport  rate  whose 
value  depends  upon  the  distance  traveled.  Then  the  problem  takes  the  form: 

(4.0)  min  £  [Tj(d(x.  ay))  +  pj]zj 

J 

'”4) 

(4.1)  subject  to  z0  -  /(z) 

The  form  of  this  problem  is  similar  to  that  encountered  earlier  and  Lemma  1  applies,  yielding  a 
cost  function  in  the  form  of  (*).  Consequently,  Theorem  1  applies.  As  the  form  of 
Tjldix,  ay)]  increases  in  complexity,  the  difficulty  encountered  in  solving  the  location  problem 
analogous  to  min  h(x)  also  increases.  Conceptually,  at  least,  the  same  general  computational 
scheme  can  be  utilized.  When  linear  costs  for  the  distribution  of  product  are  considered,  along 
with  nonlinear  input  transport  costs,  Theorem  2  also  applies. 

Under  some  circumstances,  the  prices  paid  for  inputs  at  their  sources  may  depend  upon 
the  quantity  of  input  purchased,  and  then  problem  P2  can  be  written: 

(5.0)  min  £  (#>/*/)  +  rj  d{x,  <tj))  z, 

j 

(P5) 

(5.1)  subject  to  zo-/(z). 

For  a  fixed  but  arbitrary  location,  x,  consider  the  minimization  over  the  vector  of  input  vari¬ 
ables,  z.  The  first  order  conditions  are: 

(5.2)  id  pjd  Zj)  zj  +  pj(zj)  +  rj  d(x,  aj)  -  fi(x)  (d//8zy), 

where  the  asterisk  denotes  the  optimal  value  and  p.  is  the  multiplier.  Multiplying  (5.2)  by  zj, 
rearranging  and  summing  over  J  yields: 

(5.3)  T  (pj  (z;)  +  rj  d(x,  aj ))  z?  -  p(x)  T  (8//8zy)  zj  -  T(d  pjd  iy)  zf. 

j  T  j 

! 

1 

i 
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Notice  that  the  left-hand  side  of  (S.3)  has  the  same  form  as  (5.0)  and  therefore  the  right-hand 
side  of  (5.3)  is  an  equivalent  form  of  the  objective  function  for  P5.  This  suggests  the  following 
theorem. 

THEOREM  3:  With  respect  to  problem  P5,  suppose  the  following  assumptions  hold: 

(a)  /is  homothetic 

(b)  the  input  price  functions  Pj(zj)  all  have  the  property  that  (d  pjd  Zj)  z}  —  con¬ 
stant  -  Cj  for  all  values  of  Zj  in  the  domain  of  Pj(zj). 

Then: 

(1)  The  production-location  problem  can  be  written  as  a  problem  in  the  location  vari¬ 
able  only. 

(2)  A  single  optimal  location  exists  for  all  levels  of  output, 

(3)  When  linear  costs  of  transporting  output  are  included,  as  in  P3,  the  separability  of 
production  and  location  problems  holds,  but  the  optimal  location  changes  with  the 
level  of  output,  and 

(4)  When  linear  costs  of  transporting  output  arc  included,  as  in  P3,  there  is  a  single 
optimum  location  if  the  production  function  is  homogeneous  of  degree  one  and 
the  change  in  output  occurs  proportionally  over  all  markets. 

PROOF:  A.  J.  Goldman  [6]  has  shown  that,  if  /  is  homothetic,  then 
£  ( df/dzj )  Zj  -  K(z0)  for  all  z  such  that  z0  -  f(z).  This  and  the  second  assumption  allow 

reduction  of  the  right-hand  side  of  (5.3)  to: 

(5.4)  min  ft(x)  K(z0)  +  C  =  min  n(x), 

X  X 

where  XC,  A  C.  Thus,  results  (1)  and  (2)  are  proven.  Results  (3)  and  (4)  also  follow  directly 

j  ” 

from  the  arguments  used  in  developing  Theorem  2.  □ 

A  question  of  interest  is  the  applicability  of  the  second  assumption  in  the  statement  of 
Theorem  3.  The  differential  equation 

(5.5)  ( d  pj/d  Zj)  z}  -  Cj 

has  the  solution: 

(5.6)  PjKZj)  -  Cij/zj  +  c2j  or  pj(zj)  Zj  -  cXj  +  c2j  Zj. 

This  suggests  that  the  cost  of  purchasing  some  positive  quantity  of  input  (zj  >  0)  can  be 
divided  into  a  fixed  cost  cXj  associated  with  any  purchase  independent  of  size  and  a  variable 
cost  cjj  for  every  unit  purchased.  This  is  not  an  usual  scheme  for  setting  a  price.  In  addition, 
if  the  input  demand  function  can  be  soiled  so  that  cy  —  0,  then  Pj(zj)  “  c\j!*j  or 
Pjizj)  Zj  -  cXj,  a  constant  elasticity  demand  function.  Over  all,  (5.6)  can  be  interpreted  as  a 
pricing  mechanism  that  offers  quantity  discounts  in  terms  of  a  fixed  cost  and  a  constant  unit 
cost. 
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Now  consider  P2  with  transportation  costs  that  depend  on  the  quantity  shipped.  Then  P2 
takes  the  form: 

(6.0)  min  £  ( pj  zj  +  7}  (zj.  x)) 

j 

(P6) 

(6.1)  subject  to  20  -  f(z) 

where  Tj(zj,  x)  is  the  cost  of  transporting  zj  units  from  the  source  at  a,  to  the  plant  at  x  For 
arbitrary  fixed  x,  the  first  order  conditions  are: 

(6.2)  Pj  +  d TJbzj  « p. (x)  (Bf/dzj)  j-  1 . J 

Multiplying  (6.2)  by  z],  summing  over  all  j,  rearranging  and  adding  £  Tj(.zj,  x )  to  both  sides, 

j 

yields: 

(6.3)  T(PJZJ+  TJ(zJ'  J^(BfldZj)z*j  -  XteTj/dZj)  z]  +  £7}(r*,  x). 

j  j  J  J 

The  left-hand  side  of  (6.3)  is  similar  to  (6.0);  so  the  right-hand  side  of  (6.3)  is  an  equivalent 
form  of  the  objective  function  for  problem  P6.  Since  /  is  homothetic,  the  right-hand  side  of 

(6.3)  can  be  made  independent  of  z*  if  it  is  further  assumed  that: 

(6.4)  -  ( dTj/dzj )  zj  +  Tj  (zm,  x)  -  kj  for  ally  —  1,  ....  J  where 

the  kj  are  constant.  Under  these  conditions,  statements  (1),  (2),  (3),  and  (4)  of  Theorem  3 
apply  and  the  proofs  are  identical. 

The  assumption  (6.4)  is  analogous  to  that  in  statement  (b)  of  Theorem  3.  The  equation 
(6.4)  has  a  solution  of  the  form  T  -  r,  r  +  r0  where  ri  and  t0  are  constants,  and  an  interpreta¬ 
tion  of  Tjizj,  x)  in  terms  of  a  fixed  cost  plus  a  constant  unit  cost  multiplied  by  a  quantity  tran¬ 
sported  is  suggested.  It  is  not  unusual  to  find  that  transportation  costs  per  unit  distance  can  be 
approximated  by  a  fixed  cost  per  shipment  regardless  of  quantity  shipped  plus  a  cost  depending 
linearly  on  the  quantity  to  be  shipped. 

A  MULTIPLANT,  SINGLE  PRODUCT  PROBLEM  -  GENERAL  FORMULATION 

When  the  firm  operates  more  than  one  plant,  the  complexity  of  the  problem  is  substan¬ 
tially  increased.  Now,  the  firm  must  determine  the  location  of  each  plant,  the  total  output  pro¬ 
duced  at  each  plant,  the  allocation  of  that  output  to  markets,  and  the  input  mix  to  be  used  at 
each  plant.  The  multiplant  version  of  PI  can  be  written: 

(7.0)  min  £  £  C(,(x„  bm,  q,m )  +  £  £  C(x(,  aJt  zj) 

ml  I  J 

(P7) 

(7.1)  subject  to  £  -  /,( r t),  i  -  1,  ....  / 

m 

(7-2)  “  Qm,  f»  "  1 . M 

I 

where: 


x,  A  vector  location  of  the  /  th  plant,  i  -  1 . / 
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zj  ^quantity  of  the  fih  input  used  at  plant  i,  j  «  1,  . . .  ,  J 

qlm  A  the  amount  of  the  single  output  shipped  from  plant  i  to  the  mth  demand  center, 

“  /  —  1,  ....  /,  m  —  1,  ....  M. 

Although  /,  the  number  of  plants,  has  been  treated  as  fixed  in  the  formulation  of  P7,  it 
should  be  interpreted  as  the  maximum  number  of  plants  to  be  used,  since  the  optimal  strategy 
can  result  in  some  plant(s)  operating  at  a  zero  level  of  output. 

Problem  P7  embodies  a  very  difficult  problem  which  might  be  called  a  production- 
location-allocation  problem.  The  location-allocation  problem  has  received  considerable  atten¬ 
tion  in  the  literature  with  varying  degrees  of  success.*  No  attempt  is  made  in  this  paper  to  pro¬ 
pose  new  solution  methods  for  the  general  location-allocation  problem,  although  an  algorithm  is 
developed  for  a  special  version  (P8  below).  Here,  the  interactions  between  the  production, 
location,  and  allocation  decisions  are  explored. 

THE  SINGLE  MARKET  CASE 

Consider  a  multiplant  firm  which  serves  only  one  market.  Assume  first  that  the  firm  does 
not  incur  the  costs  of  the  distribution  of  output.  Then  the  Cp  terms  of  (7.0)  are  zero.  In  addi¬ 
tion,  if  the  input  transport  costs  are  linear  in  distance  and  quantity,  then  P7  becomes: 

(8.0)  min  £  £  ( tj  d(xt,  aj)  +  Pj)  zj 

i  J 

(P8) 

(8.1)  subject  to  zi)  -  f,(z,)  /  -  1.  ....  / 

P8,  if  the  output  levels  at  each  plant  z(>  are  known,  reduces  to  the  consideration  of  several 
independent  single  plant  problems  like  P2,  since  there  is  no  interaction  between  the  plants.  A 
more  interesting  case  arises  when  the  total  quantity  of  product  required  is  known  but  the  levels 
of  production  at  each  plant  are  to  be  determined  optimally.  Thus,  consider  P8  with  the  addi¬ 
tional  restriction 

(8.2)  D  -  £  zi 

/ 

with  D  known  but  z &  variable. 

Problem  (P8)  with  (8.2)  is  a  version  of  what  might  be  called  a  production-location- 
allocation  problem.  (P7  is  a  more  complex  version).  Location-allocation  problems  are  fre¬ 
quently  found  in  the  literature  where,  generally,  iterative  algorithms  have  been  developed  for 
their  solution.  In  this  case,  Lemma  1  provides  a  form  for  the  cost  function  which  does  not 
involve  the  zj  variables,  and  so  P8  with  (8.2)  is  reduced  to  a  location-allocation  problem.  Since 
a  single  set  of  optimal  locations  exists  for  all  output  rates  (i.e.,  Theorem  1  applies,  following 
Lemma  1),  the  location-allocation  (of  D  to  the  zfj)  problem  can  be  separated  into  two  problems, 
as  illustrated  below  in  terms  of  a  conceptual  one  pass  solution  algorithm: 

1.  Compute  optimal  locations,  xf,  by  solving  min  h,(xj)  for  each  L 

* i 


•For  works  on  location-allocation  pro  We  ms  consult  the  excellent  bibliography  by  R.  Francis  and  J.  Goldstein  15). 


NAVAL  RESEARCH  LOGISTICS  QUARTERLY 


VOL.  29,  NO.  2,  JUNE  19*2 


PRODUCTION-LOCATION  PROBLEMS 


287 


2.  Since  location  decisions  are  not  affected  by  changes  in  output  level,  the  solution 
of: 

min  £  h^xfl  g/izjj) 

4  i 

subject  to  £  r6  -  Z> 

i 

provides  the  optimal  allocation  of  D  to  the  z(y 

3.  To  obtain  the  optimal  input  mix  for  each  plant,  solve: 

min  £  (r;  d(x*.  ay)  +  pj)  zj 

j 

subject  to  zf>*—  /,(z0. 


This  algorithm  has  been  called  a  conceptual  algorithm  because,  as  in  the  single  plant  case, 
explicitly  obtaining  the  separability  of  the  production  and  location  problems  is  not  always  possi¬ 
ble. 


In  the  important  special  case  of  a  production  function  homogeneous  of  degree  one,  a 
characterization  of  the  optimal  solution  can  be  obtained  without  explicitly  finding  h(x).  After 
optimal  locations  have  been  found  in  Step  1 ,  the  allocation  problem  can  be  solved  on  a  per  unit 
basis  and  the  problem  of  Step  2  reduces  to: 

min  T  C,  zj 
4  ' 

subject  to  £  zj  -  D\  z&  >  0;  i  -  1 . / 

i 

where  C,  is  the  cost  of  production  and  transportation  per  unit  of  output  at  plant  /  located  at  x* 
determined  in  Step  1.  Hence,  the  problem  in  step  2  becomes  a  linear  program  with  one  con¬ 
straint,  suggesting  that  the  optimal  solution  will  contain  one  z(>  value  equal  to  D  (for  some  /) 
and  the  remaining  zq  values  equal  to  zero  (k  *  i).  Consequently,  the  optimal  solution  to  P8 
with  (8.2)  when  each  /,(z 0  is  homogeneous  of  degree  one,  can  be  expected  to  have  a  single 
plant  operating  at  a  positive  level. 

Returning  to  P8,  suppose  the  firm  incurs  costs  for  distribution  of  product  which  are  linear 
in  distance  and  quantity  shipped.  Then  P7  becomes 

(9.0)  min  £  r  q,  d (x,f  b)  +  II  (tj  d(xt,  ctj)  +  pj)  zj 

*f  *}■  i,  i  i  i 

(P9) 

(9.1)  subject  to  q,  «  /,(z0  f-  1 . / 

(9.2)  2>-Z> 

i 

Using  Lemma  I,  P9  is  written: 

(10.0)  min  £  r  q,  d(x„  b)  +  J  ht  (x,)  g,  (g() 

*!■  *<  I  I 
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(P10) 

(10.1)  subject  to  £  q,  -  D. 

i 

Consequently,  Theorem  2  applies  to  P10  and,  conceptually  at  least,  separability  between  the 
production  and  location  aspects  of  the  problem  can  be  attained. 

The  form  of  (10.0)  suggests  little  about  the  effect  of  changes  in  the  output  level  on  the 
optimal  location.  In  general,  the  locations  will  change  with  a  change  in  the  quantity  demanded 
since  not  only  will  each  plant’s  output  change,  but  the  proportion  of  the  demand  produced  by 
individual  plants  may  change  as  well.  It  the  production  functions  all  exhibit  constant  returns  to 
scale,  then  (10.0)  can  be  rewritten: 

(10.2)  min  T  ^(min  r  d(xt,  b )  +  ht(x,)) 

«/  i  *i 

which  involves  the  solution  of  a  series  of  single  plant  location  problems,  one  for  each  plant, 
and  then  selection  of  the  single  plant  for  which  d(x *  b )  +  h,  (xp  is  minimum  to  supply  all  of 
D. 


Next  consider  the  case  of  nonlinear  (with  quantity)  transport  costs  for  the  distribution  of 
product.  Now  P10  becomes: 

(11.0)  min  £  C,(.q,)  d(x b)  +  £  h,(xt)  g,(q,) 

*>•  *  t 

(pin 

(11.1)  subject  to  £  -  D. 

i 

There  is  little  that  can  be  said  about  Pll  in  general.  A  simplification  can  be  obtained  in  the 
case  where  C,(q,)  ■»  gt(qP  wherein  (11.0)  becomes: 

(11.2)  min  T  gt  (qt)  (min  d(xt,  b )  +  Af0t,)). 

«i  T  *, 

The  portion  of  (11.2)  within  parenthesis  is  a  problem  in  the  location  variables,  x„  only,  and 
consequently  the  location  problem  is  separable  from  the  allocation  and  production  problems. 
The  function  C,(qf)  represents  economies  or  diseconomies  of  scale  (with  respect  to  quantity)  in 
transportation;  while  g,(q,)  represents  a  measure  of  economies  of  scale  in  production.  When 
these  economies  of  scale  are  the  same,  (11.0)  can  be  written  as  (11.2).  g,(qf)  has  inverse  rela¬ 
tionship  with  the  usual  interpretation  of  economies  of  scale  in  production.  For  example,  let 
/(z)  -  F(H(z))  -  (//(z))*  with  fi  >  1.  Then,  *,(?,)  -  r'(H(z))  -  q,u'%  so  that 
d2  g,(q,)/d  q2  <  0.  Consequently,  the  assumption  C^q,)  -  g,(q,)  will  hold  if  the  economies  of 
scale  in  production  are  just  "cancelled  out"  by  diseconomies  of  scale  in  the  transportation  of 
output  or  vice  versa. 

THE  MULTIPLE  MARKET  CASE 

Now,  consider  a  problem  analogous  to  P9  for  the  case  of  multiple  markets.  The  location 
and  input  mix  of  each  plant  must  be  determined,  as  well  as  the  proportion  of  the  known  total 
demand  at  each  market  that  will  be  satisfied  by  each  plant.  The  problem  is: 

(12.0)  min  £  £  r  qm  d(x„  bm)  +  £  £  (ry  d(x„  aj)  +  pj)  zj 

«*■  i  *  i  ) 
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subject  to  £  qim  -  f(z').  i  -  1 . / 

m 

02.2)  £  -  Qm  «  -  1 . Af 

i 

Once  again  utilizing  Lemma  1,  (12.0)  can  be  rewritten  as: 

02.3)  min  £  £  r  qim  d(xit  bm )  +  £  h,  Or,)  g,  (q,). 

*r  1m  i  m  i 

This  is  a  location-allocation  problem  in  which  the  costs  of  production  are  incorporated  in  the 
objective  function.  In  fact,  the  allocation  portion  is  similar  to  a  transportation  problem.  Thus, 
a  rather  general  production-location  allocation  problem  has  been  reduced  through  the  homothe- 
ticity  assumption  and  Lemma  1  to  a  form  of  the  location-transportation  problem.  Solution 
algorithms  exist  for  such  problems,  at  least  when  the  h,(x,)  and  £,(?,)  functions  are  linear. 

CONCLUDING  COMMENTS 

The  results  presented  in  this  paper  demonstrate  that  including  consideration  of  input  sub¬ 
stitutability  through  the  use  of  production  functions  often  does  not  significantly  increase  the 
complexities  encountered  in  solving  for  the  optimal  plant  location (s),  optimal  input  mix,  and 
optimal  allocation  of  product  to  customer.  The  location  problem  to  be  solved  is  more  complex 
than  the  location  problem  which  results  when  the  potential  for  input  substitution  is  ignored. 
However,  algorithms  similar  to  those  developed  for  traditional  location  problems  can  be 
employed.  The  results  derived  in  this  paper  depend  upon  the  assumption  that  the  production 
function  summarizing  the  appropriate  technology  can  be  reasonably  approximated  by  a 
homothetic  function.  Many  theoretical  or  engineering  based  production  functions  and  most 
empirical  production  functions  can  be  adequately  represented  by  homothetic  functions.* 
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ABSTRACT 

Simple  direct  smoothing  formulas  are  derived  for  updating  coefficient  esti¬ 
mates  and  forecasts  in  a  discounted  least  squares  model.  These  formulas  are 
the  natural  extensions  of  R.  G.  Brown's  well-known  smoothing  formulas  to  a 
general  econometric  setting  with  arbitrary  explanatory  time  series.  The  recur¬ 
sive  updating  process  and  its  forecast  error  properties  are  illustrated  via  a  sim¬ 
ple.  yet  realistic  numerical  example. 


1.  INTRODUCTION 

Discounted  least  squares  (DLS)  is  a  generalization  of  the  familiar  ordinary  least  squares 
(OLS)  model  of  econometrics.  DLS  retains  the  basic  OLS  assumption  of  uncorrelated 
homoscedastic  disturbances,  but  it  introduces  artificial  heteroscedasticity  by  discounting  past 
squared  deviations  in  the  least  squares  minimization.  In  effect,  the  procedure  simply  gives 
more  weight  to  the  most  recent  observations.  OLS  is  a  special  case  of  DLS  where  the  discount 
factor  is  equal  to  one. 

Despite  the  ostensible  artificiality  of  DLS,  the  technique  actually  provides  considerable 
flexibility  in  model  (coefficient)  updating.  The  discount  factor  can  be  considered  as  a  con¬ 
venient  lever  to  adjust  the  dynamism  of  the  updating  process.  As  a  practical  matter,  the  tech¬ 
nique  works  well  in  some  real-world,  short-term  forecasting  applications,  in  the  sense  that  it 
improves  on  OLS  and  compares  favorably  with  other  generalized  techniques. 

Although  DLS  doesn't  seem  to  have  generated  much  interest  among  econometricians,  it 
does  form  the  basis  for  exponential  smoothing  models.  These  smoothing  models  relate  a 
dependent  series  to  specified  regular  functions  of  time,  and  they  yield  neat,  direct  steady-state 
formulas  for  simultaneous  coefficient  updating  and  origin  translating.  This  steady-state  updat¬ 
ing  technique  was  originated  by  Brown  111;  a  more  modem,  accessible  reference  is  Montgomery 
and  Johnson  [3].  Brown’s  specialized  version  of  DLS  is  widely  known  and  used,  but  to  our 
knowledge  his  approach  hasn’t  been  extended  explicitly  to  a  more  general  setting  with  arbitrary 
explanatory  time  series,  although  the  concept  of  recursive  updating  is  not  new  (see  Odell  and 
Lewis  [41).  The  objective  of  this  paper  is  to  explicitly  extend  Brown’s  formulas  to  a  general 
econometric  setting  and  to  illustrate  their  potential  practical  value  via  a  simple,  yet  realistic 
example. 
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DLS  is,  of  course,  a  special  case  of  generalized  least  squares  (GLS),  and  hence  the  solu¬ 
tion  of  a  DLS  model  is  conceptually  routine,  in  that  the  normal  equations  are  straightforwardly 
constructed  and  solved  via  matrix  inversion.  Unlike  arbitrary  GLS  models,  however,  the  DLS 
normal  equations  can  be  reduced  to  a  simple  direct  form  for  coefficient  updating,  and  matrix 
inversion  an  be  avoided  after  initialization.  We  derive  these  direct  "smoothing”  formulas  in 
Section  3  after  reviewing  the  general  DLS  structure  in  Section  2.  Section  4  reviews  Brown’s 
specialized  version  of  DLS  and  provides  a  conceptual  link  to  his  steady-state  smoothing  formu¬ 
las.  In  Section  S,  we  illustrate  the  use  of  DLS  in  a  simple  econometric  model;  although  the 
particular  model  is  somewhat  crude,  it  provides  us  with  a  device  to  benchmark  DLS  forecast 
error  performance.  Section  6  elaborates  on  the  key  issue  of  selecting  a  specific  value  for  the 
discount  factor. 

2.  THE  DISCOUNTED  LEAST  SQUARES  MODEL 

Let  y(l),  . ..  ,  y(t)  be  observations  of  a  time  series  for  periods  1,  ,  r;  form  the  /- 

dimensional  column  vector  y  (/)  -  (y(l),  ....  y(t))'.  Let  x(l),  ...  ,  x(t)  be  a  time  series  of 
explanatory  //-dimensional  column  vectors  for  periods  1,  . ../;  form  the  ixn  matrix 
X(r)  -  tx(l) . x(r)]'.  Then,  we  have  the  usual  linear  model  formulation 

(1)  y(/)-X(/)0  +  u(/) 

where  u(r)  -  (u(l) . u(t))'  is  the  /-dimensional  disturbance  vector  and  P  is  the  unknown 

//-dimensional  coefficient  vector.  The  coefficient  vector  P  is  assumed  to  be  time  invariant. 

As  usual,  we  assume 

(2)  £(u(/))  -  0 
E(u(/)u(/)')  -  <r2I(/) 

where  o-2  is  an  unknown  scalar,  0  is  the  /-dimensional  zero  column  vector,  and  I(r)  is  the  /  x  / 
identity  matrix.  Rather  than  forming  the  usual  (OLS)  normal  equations,  however,  we  choose 
to  minimize  the  discounted  sum-of-squares  (y (/)  -  X(/)b(/))’W(/)(y(/)  -  X(/)b(/))  where 

W (/)  -  diag{8'-1 . 8.  1}  is  a  diagonal  /  x  /  weight  matrix  incorporating  the  discount  factor 

8  €  (0.  1].  This  minimization  yields  the  DLS  normal  equations 

(3)  X(t)HKi)XU)bU)  -  X(/)'W(/)y(/) 

where  b(/)  is  the  DLS  estimate  of  p  at  period  i  Equation  (3)  can  be  written  equivalently 

(4)  b(/)  -  S(/)"'v(/) 

where  v(/)  -  X(/)'W(/)y(/)  and  S(/)  -  X(/)'W(/)X(/)  is  assumed  nonsingular. 

The  effect  of  the  weight  matrix  in  the  discounted  sum-of-squares  minimization  is  to  put 
more  emphasis  on  the  most  recent  residuals.  What  we  are  doing  is  compensating  for 
deficiencies  in  the  static  linear  model  (1);  i.e.,  we  don’t  really  expect  p  to  be  time  invariant  so 
we  are  placing  greater  weight  on  the  most  recent  observations. 

Given  that  we  have  solved  the  normal  equations  (4)  for  b(/),  we  denote  our  forecast  of 
y(t  +  k ),  the  observation  //periods  hence,  by 

(5)  ykU  +  *)  -  x(/  +  *)*(/) 
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assuming  x(t  +  k)  is  known;  in  most  econometric  situations,  of  course,  x(r  +  k)  is  itself  a 
forecast  derived  by  some  other  means.  Equation  (S)  can  be  written  equivalently 

(6)  ykU)  “  *(/)!*(/  -  k ) 

for  t  >  k,  and  we  define  the  ^-period  forecast  error  by 

(7)  ek(t)  -  y(t)  -  yk0)- 

We  will  also  refer  in  Sections  S  and  6  to  the  absolute  error  ak(t)  —  |e*(/)|  and  the  absolute 
percentage  error  pk(t)  -  100 ak(t)/yk(t). 

This  section  has  outlined  the  basic  assumptions  and  mechanics  of  DLS.  From  here  on, 
our  focus  becomes  computational.  Our  goal  is  to  put  the  elements  of  equation  (4)  into  a  recur¬ 
sive  format,  which  is  why  we  have  explicitly  attached  the  time  index  to  all  of  our  variables. 

3.  RECURSIVE  SMOOTHING  FORMULAS 


To  develop  recursive  formulas,  we  first  partition  X(r),  y(r),  and  W(r)  as  follows. 
X(/)'-  [X(r  -  1)'.  x (/) ) 
y(/)'  -  ly(r  -  1)’,  >-(r)l 


W(r)  — 


8W(r  -  0,0 

O',  1 


where  the  variables  indexed  by  t  -  1  are  defined  as  in  Section  2  over  the  first  t  -  1  observa¬ 
tions  and  0  is  the  (t  -  1) -dimensional  zero  column  vector. 


Ordinary  matrix  multiplication  yields  the  following  recursive  formulas  for  S(r)  and  v(/). 

(8)  S(r)  -  XOmOXO) 

-  8X(r  -  lmr  -  l)X(r  -  0  +  x(/)x(f)' 

-  8S(r  -  1)  +  x(/)x(r)' 

(9)  v(r)  -  XU)’WU)j(t) 

-  8X0  -  1)^(1  -  l)y(f  -  1)  +  xU)y(t) 

-  8v(r  -  1)  +  x(/)y(r). 

Equations  (8)  and  (9)  are  the  basic  smoothing  formulas  for  the  elements  of  (4).  However,  the 
smoothing  is  still  indirect  and  calculation  of  the  coefficient  estimate  vector  b(r)  in  (4)  still 
requires  matrix  inversion  at  each  update. 


To  obtain  a  direct  smoothing  formula,  we  note  that 

(8S(r  -  I)  +  x(r)x(r)')b(r)  -  6 v(r  -  1)  +  xU)y(t) 

-  8S(r  -  l)b(r  -  1)  +  x(r)  (et(r)  +  x(z)*b(f  -  D) 
and  upon  rearranging  terms  that 

(10)  S(r)(b(r)-b(r- l))-x(r)e,(f) 

Mr)  -  b(r  -  I)  4-  *(r)e|(r) 
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where  i(r)  —  S(r)_lx(r).  Equation  (10)  is  a  direct  smoothing  formula  for  updating  Mr), 
although  the  calculation  still  involves  matrix  inversion  at  every  stage. 

Now  (following  Rao  [$],  problem  2.8,  p.  33), 

S(r)->  (8S(r  -  1)  +  x(t)x(t)')  -  I 
where  I  is  the  n  x  « identity  matrix.  Hence, 

8S(r)-‘S(r  -  1)  -  I  -  S(r)-*x(r)x(r)' 

-  I  -  i (t)x(t)' 

and  upon  post-multiplying  each  side  by  S(r  -  l)-1,  we  get 

(11)  S(/)-‘  -  8"1  (I  -  i(r)x(r)')S(r  -  l)-1 

so  that  S(r)~!  is  obtainable  directly  from  SO  -  l)-1  via  simple  matrix  multiplication  once  x(r) 
is  known.  To  get  a  formula  for  tO)  in  terms  of  SO  -  l)-1,  we  post-multiply  both  sides  of 

(11)  by  x(r)  to  obtain 

iO)  -  8— *  (I  -  iO)xO)')SO  -  l  )_,x(r) 
and  upon  rearranging  terms  we  get 

(12)  iO)  -  (8  +  xO)*SO  -  l)-'xO))-'SO  -  l)-'x(r). 

To  summarize,  once  we  have  obtained  SOo)-1  and  b(r0)  for  some  base  period  r0  from  (4), 
updates  can  be  performed  directly,  without  matrix  inversion,  via  formulas  (10),  (11)  and  (12) 
together. 

In  this  era  of  rapid,  economical  computation,  avoidance  of  matrix  inversion  is  not  so 
important  as  it  once  was.  Furthermore,  single-equation  econometric  models  usually  don’t 
involve  large  numbers  of  explicit  coefficients  because  multicollinearity  tends  to  render  the  esti¬ 
mation  process  unstable  and  meaningless;  and,  of  course,  multicollinearity  presents  the  same 
problem  in  DLS  as  it  does  in  OLS.  Nevertheless,  the  simple,  direct  updating  process  embodied 
in  (10),  (11),  and  (12)  is  appealing  and  very  easy  to  implement,  especially  in  a  matrix-oriented 
computer  language  like  APL.  The  updating  process  is  also  very  economical  in  terms  of  data 
storage  requirements.  Furthermore,  equation  (10)  provides  a  conceptual  bridge  to  the  well- 
known  steady-state  smoothing  formulas  of  R.G.  Brown  [1]. 

4.  RELATIONSHIP  TO  BROWN’S  STEADY-STATE  FORMULAS 

Brown  (1]  worked  out  steady-state  results  for  the  special  case  where  x(r)  —  A*x(0)  for 
some  fixed  n  x  n  transition  matrix  A  and  suitable  starting  vector  x(0);  see  also  Montgomery 
and  Johnson  [3]  for  a  complete  derivation.  This  special  case  of  DLS  includes  all  independent 
variables  which  are  regular  functions  of  time  (e.g.,  polynomial  and  tri|"<nometric  functions  of 
time),  and  it  allows  reduction  of  the  direct  smoothing  equations  to  a  very  neat  steady-state 
form.  On  the  other  hand,  it  is  clear  that  general  econometric  modeling  is  not  encompassed  by 
this  simplified  framework. 

Brown’s  special  case  can,  of  course,  be  treated  within  our  general  setup  of  Section  3,  but 
there  is  no  apparent  simplification  of  the  results  or  saving  of  update  effort.  Brown,  however, 
obtained  simplified  results  via  an  "origin  shift*  of  the  coefficients 

(13)  i(r)  -  (AO %(r)  -  (A0%(r). 
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Then,  since  (A')  (A  O',  we  get 

(14)  yk(t  +  k)  -  x(/  +  kmt)  -  x(r  +  ky(KTiU) 

-  (A-'xO  +  k))i(t)  -  x(*)“b(f) 

so  that  forecasts  depend  only  on  the  transformed  coefficient  estimate  vector  b(r)  and  the 
"shifted"  explanatory  vector  x(k). 

The  direct  smoothing  equation  (10)  becomes 

(15)  (A')-'b(r)  -  (A')-('-‘)b(f  -  1)  +  i (t)et(t). 

b (<)  -  A i(t  -  1)  +  i(t)e,(t) 
where  i(t)  -  (A')0t(/)  -  (AO'z(f).  Now, 

i(t)  -  (A')S(r)-1x(/)  -  (A')S(f)-,A'x«» 

-  SO)-'x(O) 

SO)  -  A“S(f)  (AO"'  -  A"SO)  (A-0' 

-  A-'j^8'^(A0t(0))(A>x(0))’ 

-  8'-J( A-^xXO))  (A-('-^x(0))' 

-  8^(A">x(0))  (A-'x(O))’. 

For  a  well-behaved  transition  matrix  A  and  8  €  (0,  1),  the  limit  S(«>)  —  lim,_«,  S(r)  exists  and 
is  generally  approached  rapidly.  Hence,  equation  (15)  rapidly  takes  the  steady-state  form 

(18)  bO)  -  A'b(r  -  1)  +  i(oo)e,0) 

where  z(<»)  -  S(*»)_1x(0).  This  is  a  very  neat  result  in  that  the  matrix  S(<»)  is  easily  com¬ 
puted  for  a  given  discount  factor  and  need  only  be  inverted  once  to  get  the  steady-state  multi¬ 
plier  x(oo);  then,  given  a  starting  vector  b(0),  steady-state  updates  can  occur  routinely  via  equa¬ 
tion  (18).  In  practice,  an  initial,  untransformed  DLS  estimate  bO0)  is  obtained  via  equation 
(4)  and  bO0)  -  (A')'°  b(/0)  is  used  as  b(0)  to  start  the  steady-state  update  process  (18);  other¬ 
wise,  the  starting  vector  b(0)  is  chosen  judgmentaliy. 

Besides  its  mathematical  simplicity,  the  steady-state  update  process  (18)  has  the  additional 
desirable  feature  of  very  limited  data  storage  requirements.  Once  computed,  *(<»)  is  a  fixed 
parameter  vector  and  A  is  a  fixed  parameter  matrix;  besides  these  fixed  parameters,  one  need 
only  maintain  the  most  recently  updated  coefficient  estimate  vector  b(f).  It  is  worth  noting  that 
the  generalized  recursive  procedure  of  Section  3  is  just  as  parsimonious  in  terms  of  data  storage 
requirements,  although  more  routine  updating  is  necessary  (e.g.,  SO)-1  and  s (/),  as  well  as 
b(r)).  The  following  two  examples  illustrate  both  Brown’s  steady-state  updating  process  and 
the  more  general  recursive  updating  process  of  Section  3. 


(16) 

(17) 
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EXAMPLE:  SIMPLE  EXPONENTIAL  SMOOTHING 

In  this  example,  we  assume  E(yU))  —  /3  for  all  t,  where  /3  is  an  unknown  scalar  constant. 
Hence,  n  -  1  and  we  have  the  simple  scalar  relationships  xO)  -  A  -  1,  60)  —  Mr), 
SO)  -  S(r),  and  zO)  -  zO)  for  all  t  Assuming  8  €  (0,  1)  and  a  -  1  -  8,  we  have 

SO )  -  8'~J  -  JJ#'-  (1  -  80a-1  —  a"1 

z(f)-SO)-1  — a. 

Hence,  the  transient  update  equation  (CIO)  or  (15))  takes  the  form 
Mr)  -  60  -  1)  +  (1  -  SO-'aejO) 
while  the  steady-state  update  equation  (18)  takes  the  familiar  form 
bO)  -  bO  -  1)  +  aet(t). 

In  this  simple  situation  y*0  +  k)  •“  60)  for  all  k  ^  1,  so  our  forecasts  for  all  future  periods  at 
time  t  are  identical. 

EXAMPLE:  EXPONENTIAL  SMOOTHING  WITH  TIME  TREND 


In  this  example,  we  assume 

£(yO))-0i+0jf-*O)’0 

for  all  t,  where  n  -  2,  0  -  (£,,  p2Y  is  an  unknown  vector  consisting  of  intercept  and  trend 
(or  slope)  0j,  and  xO)  -  (1,  tY.  In  Brown’s  setup,  xO)  -  AMO)  where  x(0)  -  (1,  0)'  and 

I  1  ol 


It  follows  that 


A J- 


1  0 
j  1 


and  AMO) 


(1,  jY  for  any  real  integer  j.  Assuming  8  €  (0,  1)  and  a 
,-i  [  l  -y|  I  M'l  -/i«| 

'%p \ -jA~ l -aw  /.<*>! 


S(r) 


1  -  8,  we  get 


where 

/O0)  -  T8'-  (1  -80a-1 -a"1 

/, 0)  -  (8/0O)  -  '*')«■'  —  *«-2 

m  -  <o + »>/t«  - t(t  - — *(1 + 
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Then, 

kit)  -  S(*)-‘x(0) 

-  <Joit)f2it)  -  ft  it )*)-lC/j(r).  Ait))'  -  («(1  +  8),  «2)'. 

Hence,  the  transient  update  equation  (IS)  takes  the  form 
bxit)  -  biit  -  1)  +  b2it  -  1)  +  i,(r)e,(f) 
b2it)  -»  i>2it  -  1)  +  i2(r)e)(f) 

where 

i,(f)  -  if0it)f2it)  -  Ait)1)'1  Ait) 
i2it)  -  if0it)f2it)  -  Ait)2)'lAit) 
and  the  steady-state  update  equation  (18)  takes  the  familiar  form 
8i(f)  -  biit  -  1)  +  b2it  -  1)  +  ail  +  «)*,(/) 
b2it)  »  b2it  —  1)  +  a2ei  it). 

Since  tit)  —  (A_0'z(f)  -  (i((/)  -  z2it)t,  z2it))',  we  have  r,(r)  =  ii(«>)  -  z2(<»)f- 
«(1  +  8)  -  a2t  and  z2it)  =  a2  for  /  sufficiently  large.  Hence,  the  general  update  equation  (10) 
takes  the  form 

biit)  —  btit  -  1)  +  iiiit)  ~  z2it)t)etit) 
b2it)  -  b2it  -  1)  +  i2il)etit) 

which  indicates  that  the  intercept  update  process  would  become  increasingly  volatile  over  time. 
This  volatility,  however,  is  immaterial  to  the  forecasting  process.  Since  b(r)  -  (AO'b(f)  - 
(*i (f)  +  b2it)t,  b2it))\  the  forecast  for  the  *-th  future  period  at  time  / is  identically 

ykit  +  k)  —  8j(/)  +  b2it)  it  +  k) 

-  (*,(/)  +  k2it)t)  +  b2it)k 

-  biit)  +  b2it)k. 

Brown's  steady-state  approach  is  very  attractive  for  DLS  models  which  depend  on  regular 
functions  of  time.  We  think  the  smoothing  formulas  of  Section  3  provide  a  straightforward 
extension  of  Brown’s  approach  to  general  econometric  models  with  arbitrary  explanatory  time 
series.  The  second  example  above  indicates  that  the  general  update  process  (10)  is  likely  to  be 
less  volatile  over  the  long  run  if  systematic  growth  components  are  removed  from  the  various 
series  beforehand  by  first-differencing  or  some  other  means.  We  have  not  found  this  to  be  an 
important  point  in  practical  econometric  work,  however.  The  example  in  the  next  section  illus¬ 
trates  a  simple  model  where  business  cycle  factors  predominate  and  drive  the  coefficient  (fore¬ 
cast)  update  process. 

S.  A  SIMPLE  ECONOMETRIC  EXAMPLE 

To  illustrate  the  efficacy  of  DLS  in  an  econometric  context,  we  put  yit)  equal  to  total 
U.S.  new  passenger  car  unit  sales  in  quarter  rand  xit)  -  (jr|(r),  x2(/))',  where  X|(/)  *  1  and 
x2it)  is  the  motor  vehicles  and  parts  personal  consumption  component  of  real  Grots  National 
Product  (1972$)  in  quarter  t  Both  yit)  and  x2(t)  are  seasonally  trusted,  annualized  rates 
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from  the  Bureau  of  Economic  Analysis,  U.S.  Department  of  Commerce.  The  overall  time 
interval  ranges  from  the  first  quarter  of  1970  to  the  last  quarter  of  1980.  We  use  the  12  quarter 
interval  from  1970:1  to  1972:4  to  initialize  the  model.  We  use  the  32  quarter  interval  from 
1973:1  to  1980:4  to  monitor  the  update  process  and  to  measure  ex  post  forecast  errors  for  1,  2, 
3,  and  4  quarters  ahead.  The  simulated  ex  post  forecasts  incorporate  actual  historical  observa¬ 
tions  for  the  consumption  series  x2(/);  in  reality,  future  values  would  have  to  be  estimated  (ex 
ante)  from  some  type  of  quarterly  GNP  forecasting  model  with  the  requisite  level  of  disaggre¬ 
gation. 

Now  clearly  the  time  series  y(t)  and  x2(t)  should  be  highly  correlated  since  most  new 
passenger  car  sales  (aside  from  business  and  government  purchases)  are  imbedded  in  the 
broader  personal  consumption  series,  which  also  includes  a  net  used  car  component,  small 
trucks,  other  recreational  vehicles,  service  parts,  and  so  forth.  On  the  other  hand,  the  relation¬ 
ship  between  the  two  series  is  far  from  exact  and  may  change  over  time,  cyclically  and  perhaps 
secularly.  Moreover,  the  new  car  sales  series  is  widely  followed  as  a  cyclical  indicator,  and  a 
simple  mechanism  to  generate  forecasts  from  a  GNP  component  is  very  desirable. 

Table  1  summarizes  the  average  signed,  absolute,  and  absolute  percentage  forecast  errors 
1,  2,  and  4  quarters  ahead  (as  defined  in  Section  2)  for  discount  factors  ranging  from  .1  to  1.0; 
the  omitted  3  quarter  ahead  results  are  analogous.  Our  focus  on  simple  arithmetic  error  meas¬ 
ures  is  intuitive.  Despite  the  least-squares  basis  of  DLS,  we  think  most  people  conceptualize 
forecast  errors  in  these  simple  arithmetic  terms. 

The  average  errors  for  a  first-order  autoregressive  (AR1)  model  are  also  included  in  Table 
1.  This  GLS  model  (see  Johnston  (2],  Chapter  9,  and  Theil  [6],  Section  6.3)  is  commonly 
used  in  econometric  work  when  the  OLS  Durbin-Watson  statistic  turns  out  exceptionally  small, 
as  it  did  in  this  case  over  most  of  the  simulated  ranges.  The  AR1  solution  procedure  is  also 
commonly  known  as  the  Cochrane-Orcutt  technique. 

Table  2  contains  detailed  quarter-by-quarter  coefficient  adjustments  and  forecasts  for  the 
case  6  -  .5.  Table  3  contains  the  full  range  of  input  data  and  intermediate  results  for  that  same 
case  fi  -  .5.  This  example  is  computationally  very  simple,  so  we  just  iterated  the  indirect  equa¬ 
tions  (8),  (9),  and  (4)  to  get  updated  coefficient  estimates  b(r)  -  (b\(t),  b2(t))'  for  each  quar¬ 
ter  t  Table  2  and  3  entries  have  been  suitably  rounded  for  presentation  purposes. 

It  is  clear  from  the  simulation  results  that  OLS  is  significantly  inferior  to  both  AR1  and 
alternative  DLS  models  in  terms  of  average  absolute  forecast  error,  and  furthermore  that  ARI 
is  inferior  to  the  best  DLS  models  with  discount  factors  in  the  .5  neighborhood.  Since  the  DLS 
solution  procedure  is  also  much  simpler  than  ARI,  the  approach  certainly  seems  worthy  of  con¬ 
sideration  in  many  short-term  forecasting  situations  where  ARI  is  routinely  applied. 

It  is  interesting  to  note  in  Table  1  that  DLS  forecast  bias  generally  increases  with  the  fore¬ 
cast  horizon  for  a  fixed  discount  factor  and  with  the  discount  factor  for  a  fixed  forecast  horizon. 
On  the  other  hand,  the  average  absolute  forecast  error,  in  this  particular  example,  is  approxi¬ 
mately  minimized  across  all  the  forecast  horizons  with  discount  factor  in  the  .5  neighborhood. 
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TABLE  1  —  Summary  Forecast  Error  Analysis 


s 

Average 

1973:1-1980:4 

e\  ax  pt 

Average 

1973:2-1980:4 

02  Pi 

Average 

1973:4-1980:4 

e«  a,  pt 

.1 

-.09 

.56 

6.7 

.06 

.82 

22.4 

.86 

9.3 

.2 

-.05 

.39 

4.6 

-.05 

.48 

5.5 

-.33 

.57 

5.7 

.3 

-.04 

.31 

3.5 

-.08 

.35 

3.8 

-.34 

.51 

5.1 

A 

-.05 

.27 

2.8 

-.11 

.30 

3.1 

-.36 

.49 

4.9 

.5 

-.07 

.25 

2.6 

-.15 

.30 

3.0 

-.40 

.49 

4.8 

.6 

-.11 

.26 

2.6 

-.20 

.31 

3.1 

-.45 

.50 

4.9 

.7 

-.17 

.29 

3.0 

-.27 

.36 

3.6 

-.54 

.58 

5.7 

.8 

-.29 

.37 

3.7 

-.41 

.45 

4.5 

-.67 

.68 

6.7 

.9 

-.51 

.53 

5.3 

-.62 

.63 

6.2 

-.84 

.84 

8.1 

l.O(OLS) 

-.84 

.84 

8.1 

-.92 

.92 

8.7 

-1.09 

1.09 

10.2 

AR1 

-.19 

.35 

3.5 

-.32 

.50 

5.0 

-.64 

.74 

7.0 

TABLE  2  —  Detailed  Results  fork  —  .5 


Quarter 

t 

6,(r) 

b,U) 

xikK 

RSI 

■51 

■51 

■51 

■51 

1972:4 

■HI 

H¥l 

iff 

NA 

NA 

NA 

1973:1 

MM 

0.164 

■ 

NA 

NA 

NA 

1973:2 

14 

58.0 

eh 

• 

11.8 

NA 

NA 

1973.3 

15 

-0.58 

0.213 

55.6 

ii. i 

11.6 

11.5 

11.4 

NA 

1973.4 

16 

-5.92 

51.7 

9.8 

10.5 

10.9 

■f’fl 

1974:1 

17 

-2.47 

0.244 

47.9 

9.3 

8.7 

9.6 

K31 

1974:2 

18 

-3.45 

0.262 

47.9 

9.0 

mm 

8.7 

ffltxl 

10.3 

1974:3 

19 

-3.32 

0.257 

49.9 

111 

9.7 

9.4 

10.1 

1974:4 

20 

-4.18 

0.274 

42.3 

mm 

7.6 

7.8 

PI 

1975:1 

21 

-2.86 

0.249 

43.7 

7.9 

8.0 

8.2 

1975:2 

22 

-3.76 

0.264 

44.2 

8.2 

7.9 

8.0 

8.1 

1975:3 

23 

-0.68 

0.194 

49.8 

8.9 

MX* 

9.5 

9.5 

9.5 

1975:4 

24 

-0.01 

0.179 

52.2 

9.3 

mm 

10.0 

10.1 

10.1 

1976:1 

25 

0.77 

0.163 

56.9 

10.0 

10.3 

11.3 

11.3 

1976.2 

26 

1.62 

0.145 

57.0 

9.8 

■ 

10.2 

10.4 

11.3 

1976:3 

27 

1.79 

0.142 

57.4 

9.9 

Ef 

10.1 

10.3 

10.4 

1976:4 

28 

0.83 

0.160 

58.0 

10.2 

10.0 

10.0 

10.2 

10.4 

1977:1 

29 

0.10 

0.173 

63.5 

11.1 

11.0 

10.8 

10.8 

11.1 

1977:2 

30 

-0.75 

0.188 

63.1 

11.2 

11.0 

10.9 

10.7 

10.8 

1977:3 

31 

0.168 

63.2 

10.9 

11.2 

11.0 

11.0 

10.7 

1977:4 

32 

■El 

0.146 

64.1 

11.0 

11.2 

11.3 

11.2 

11.1 

1978:1 

33 

0.72 

0.161 

62.4 

10.7 

10.8 

10.9 

11.0 

10.9 

1978:2 

34 

0.19 

0.169 

68.0 

11.7 

11.7 

11.6 

11.8 

12.1 

1978:3 

35 

0.168 

65.6 

11.2 

11.3 

11.3 

11.3 

11.4 

1978:4 

36 

0.119 

66.8 

11.1 

11.4 

11.5 

11.5 

11.5 

1979:1 

37 

2.66 

0.130 

66.4 

11.4 

11.2 

11.4 

11.4 

11.4 

1979:2 

38 

1.85 

0.142 

59.4 

10.3 

10.4 

10.4 

10.2 

10.2 

1979:3 

39 

2.82 

0.128 

60.8 

10.7 

10.5 

10.6 

10.5 

10.4 

1979:4 

-1.71 

0.196 

EH 

9.8 

10.6 

10.4 

10.5 

10.5 

1980:1 

41 

-2.48 

0.210 

62.1 

10.6 

10.5 

10.8 

10.7 

10.7 

1980:2 

42 

-1.98 

0.202 

47.0 

7.5 

K1 

mm 

HI 

s.s 

1980:3 

43 

-1.28 

0.192 

51.5 

8.8 

Rfl 

■tv 

El 

9.4 

1980:4 

44 

-1.17 

0.189 

54.6 

9.1 

KH 

9.0 
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TABLE  3  —  Input  Data  and  Intermediate  Results  for  8  —  .5 


Quarter 

t 

*?(/) 

y(t) 

Suit) 

sl2U) 

Sn(t) 

v,(r) 

v5(f) 

1970:1 

1 

39.0 

8.8 

1.00 

39.0 

1,521 

8.8 

343 

1970:2 

2 

40.3 

9.1 

1.50 

59.8 

2,385 

13.5 

538 

1970:3 

3 

40.2 

9.0 

1.75 

70.1 

2,808 

15.8 

631 

1970:4 

4 

33.1 

6.9 

1.88 

68.1 

2,500 

14.8 

544 

1971:1 

5 

42.6 

10.0 

1.94 

76.7 

3,065 

17.4 

698 

1971:2 

6 

44.1 

9.9 

1.97 

82.4 

3,477 

18.6 

786 

1971:3 

7 

46.1 

10.4 

1.98 

87.3 

3,864 

19.7 

872 

1971:4 

8 

49.3 

10.6 

1.99 

93.0 

4,362 

20.4 

959 

1972:1 

9 

49.7 

10.5 

2.00 

96.2 

4,651 

20.7 

1,001 

1972:2 

10 

51.2 

10.7 

2.00 

99.3 

4,947 

21.1 

1,048 

1972:3 

11 

52.2 

10.8 

2.00 

101.8 

5,198 

21.3 

1,088 

1972:4 

12 

56.5 

11.6 

2.00 

107.4 

5,791 

22.3 

1,199 

1973:1 

13 

60.7 

12.3 

2.00 

114.4 

6,580 

23.4 

1,346 

1973:2 

14 

58.0 

12.1 

2.00 

115.2 

6,654 

23.8 

1,375 

1973:3 

15 

55.6 

11.1 

2.00 

113.2 

6,418 

23.0 

1,305 

1973:4 

16 

51.7 

9.8 

2.00 

108.3 

5,882 

21.3 

1,159 

1974:1 

17 

47.9 

9.3 

2.00 

102.1 

5,235 

20.0 

1,025 

1974:2 

18 

47.9 

9.0 

2.00 

98.9 

4,912 

19.0 

944 

1974:3 

19 

49.9 

9.4 

2.00 

99.4 

4,946 

18.9 

941 

1974:4 

20 

42.3 

7.4 

2.00 

92.0 

4,262 

16.8 

783 

1975:1 

21 

43.7 

8.2 

2.00 

89.7 

4,041 

16.6 

750 

1975:2 

22 

44.2 

7.7 

2.00 

89.0 

3,974 

16.0 

715 

1975:3 

23 

49.8 

8.9 

2.00 

94.3 

4,467 

16.9 

801 

1975:4 

24 

52.2 

9.3 

2.00 

99.4 

4,958 

17.8 

886 

1976:1 

25 

56.9 

10.0 

2.00 

106.6 

5,717 

18.9 

1,012 

1976:2 

26 

57.0 

9.8 

2.00 

110.3 

6,107 

19.2 

1,065 

1976:3 

27 

57.4 

9.9 

2.00 

112.5 

6,348 

19.5 

1,101 

1976:4 

28 

58.0 

10.2 

2.00 

114.3 

6,538 

20.0 

1,142 

1977:1 

29 

63.5 

11.1 

2.00 

120.6 

7,301 

21.1 

1,276 

1977:2 

30 

63.1 

11.2 

2.00 

123.4 

7,632 

21.7 

1,345 

1977:3 

31 

63.2 

10.9 

2.00 

124.9 

7,810 

21.8 

1,361 

1977:4 

32 

64.1 

11.0 

2.00 

126.6 

8,014 

21.9 

1,386 

1978:1 

33 

62.4 

10.7 

2.00 

125.7 

7,901 

21.6 

1,361 

1978:2 

34 

68.0 

11.7 

2.00 

130.8 

8,574 

22.5 

1,476 

1978:3 

35 

65.6 

11.2 

2.00 

131.0 

8,591 

22.5 

1,473 

1978:4 

36 

66.8 

11.1 

2.00 

132.3 

8,758 

22.3 

1,478 

1979:1 

37 

66.4 

11.4 

2.00 

132.6 

8,788 

22.6 

1,496 

1979:2 

38 

59.4 

10.3 

2.00 

125.7 

7,922 

21.6 

1,360 

1979:3 

39 

60.8 

10.7 

2.00 

123.6 

7,658 

21.5 

1,330 

1979:4 

40 

60.3 

9.8 

2.00 

122.1 

7,465 

20.5 

1,256 

1980:1 

41 

62.1 

10.6 

2.00 

123.2 

7,589 

20.9 

1,286 

1980:2 

42 

47.0 

7.5 

2.00 

108.6 

6,003 

17.9 

996 

1980:3 

43 

51.5 

8.8 

2.00 

105.8 

5,654 

17.8 

951 

1980:4 

44 

54.6 

9.1 

2.00 

107.5 

5,808 

18.0 
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Moreover,  there  is  little  difference  in  performance  for  discount  factors  in  the  fairly  broad  range 
.3  to  .7,  indicating  robustness  with  respect  to  choice  of  this  judgmental  parameter.  For  very 
small  discount  factors,  the  coefficients  evidently  adjust  rapidly  to  eliminate  bias  but  the  result¬ 
ing  procedure  is  excessively  volatile.  For  very  large  discount  factors  (e.g.,  OLS),  the 
coefficients  adjust  slowly  and  the  resulting  procedure,  although  smoother,  is  quite  biased.  For 
mid-range  discount  factors  (around  .5  in  this  example),  the  resulting  procedure  adjusts  rapidly 
enough  to  keep  bias  small  while  also  providing  sufficient  stability  to  yield  relatively  small  aver¬ 
age  absolute  errors;  i.e.,  it  is  a  good  blend  of  dynamism  and  stability. 

Table  2  shows  how  the  coefficients  and  forecasts  fluctuated  over  the  simulation  interval 
1973:1  to  1980:4  for  the  quasi-optimum  case  8  -  .5.  Note  that  the  slope  estimate  b2(t)  is  far 
more  stable  than  the  intercept  estimate  b,(t).  Note  also  the  characteristic  pattern  of  negative 
intercepts  during  the  1973-7S  recessionary  contraction,  a  pattern  commencing  once  again  in  late 
1979.  The  forecasts  are,  for  the  most  part,  pretty  close  to  the  actual  new  car  sales. 

This  crude  forecasting  model  can  probably  be  improved  econometrically.  For  example, 
we  have  ignored  the  possible  incremental  impact  of  strikes  and  interest  rate  fluctuations  on  new 
car  sales.  We  have  also  made  no  attempt  to  differentiate  domestics  from  imports.  Further¬ 
more,  we  are  well  aware  that,  as  a  practical  matter,  bias  is  often  reduced  in  OLS  models  by 
adjusting  the  forecasts  with  judgmental  "add  factors”.  The  purpose  of  this  example  is  merely  to 
indicate  the  potential  value  of  DLS. 

6.  CHOOSING  THE  DISCOUNT  FACTOR 

Whenever  DLS  is  employed  in  any  form,  choice  of  the  discount  factor  becomes  an  impor¬ 
tant  issue.  In  the  example  of  Section  5,  ex  post  simulation  showed  8  —  .5  to  be  a  good  choice. 
This  is  not  always  the  case,  however.  We’ve  analyzed  other  cases  where  greater  inherent  vola¬ 
tility  in  the  dependent  series  caused  a  larger  discount  factor  to  be  approximately  optimum.  In 
any  event,  this  kind  of  ex  post  simulation  analysis  is  obviously  a  good  idea  when  plenty  of  his¬ 
torical  data  exists.  Otherwise,  one  must  resort  to  judgmental  criteria. 

From  a  judgmental  standpoint,  the  author  would  tend  to  focus  on  discount  factors 
between  .3  and  1.0  (OLS)  to  maintain  some  degree  of  stability  in  the  coefficient  update  process. 
If  OLS  on  the  historical  data  yields  a  very  low  Durbin-Watson  statistic,  one  would  naturally 
choose  a  lower  discount  factor  in  this  range  to  enhance  the  dynamism  of  the  process.  In  this 
case,  DLS  provides  an  alternative  to  the  ubiquitous  AR1  procedure. 

Another  judgmental  criterion  is  the  'moving  average  equivalence"  principle  (see 
Montgomery  and  Johnson  f3j,  p.  52).  The  idea  is  to  equate  (very  approximately)  a  DLS  model 
with  an  equivalent  ”rolling”  OLS  model,  over  the  most  recent  m  historical  periods,  in  terms  of 

m-i 

the  ’average  age  of  the  data"  j/m  -  (m  -  l)/2.  Based  on  simple  exponential  smoothing, 

we  get  the  equivalence  relationship  8  ■  (m  -  1  )/(m  +  1)  or  m  -  (1  +  8)/(l  -8);  this 
equivalence  is  fairly  crude  when  extended  to  general  DLS  models,  but  it  is  nonetheless  appeal¬ 
ing.  For  example,  if  we  wanted  the  average  age  of  our  data  to  be  3  periods,  we  would  choose 
8  -  .5;  for  4  periods,  8  -  .6;  and  for  9  periods,  8  -  .8.  In  many  short-term  forecasting  prob¬ 
lems  with  quarterly  data,  an  average  age  of  2  to  8  quarters  seems  intuitively  appealing;  this 
corresponds  to  .33  <  8  <  .78. 
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Another  possibility  is  to  adjust  the  discount  factor  itself  over  time.  Trigg  and  Leach  [7] 
introduced  an  intuitive  technique  to  adjust  the  discount  factor  in  response  to  a  buildup  of  sys¬ 
tematic  forecast  errors  (see  also  Montgomery  and  Johnson  13],  pp.  175-179).  Specifically,  they 
make  8  a  decreasing  function  of  |ei(/)l/ai(r),  where  et(t)  and  di(t)  are  simple  exponentially 
smoothed  values  of  the  one-period-ahead  signed  and  absolute  forecast  errors  respectively;  this 
adjustment  decreases  8  to  increase  responsiveness  when  there  is  a  buildup  of  signed  forecast 
error  (or  bias)  in  relation  to  the  mean  absolute  forecast  error.  The  Trigg-Leach  adaptive  tech¬ 
nique  may  induce  instability  into  some  DLS  models,  and  we  haven’t  tested  it  in  a  general  con¬ 
text.  Nevertheless,  the  technique  is  widely  employed  as  an  enhancement  to  simple  exponential 
smoothing,  its  volatility  is  easily  constrained,  and  the  general  approach  seems  like  an  avenue 
worth  exploring  in  practical  forecasting  situations. 

As  we’ve  seen,  there  is  no  hard  and  fast  rule  for  choosing  a  discount  factor,  although  it 
appears  that  good  values  can  be  estimated  in  specific  situations  and  there  are  even  possibilities 
for  adjusting  the  factor  adaptively.  The  fact  that  DLS  incorporates  a  judgmental  factor  should 
not  be  considered  a  negative.  The  discount  factor  merely  adds  practical  flexibility. 
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ABSTRACT 

This  paper  analyzes  (he  problem  of  determining  desirable  spares  inveniory 
levels  for  repairable  items  with  dependent  repair  times.  The  problem  is  impor¬ 
tant  for  repairable  products  such  as  aircraft  engines  which  can  have  very  large 
investment  in  spares  inventory  levels.  While  existing  models  can  be  used  to 
determine  optimal  inventory  spares  levels  when  repair  times  are  independent, 
the  practical  considerations  of  limited  repair  shop  capacity  and  prioritized  shop 
dispatching  rules  comoine  to  make  repair  limes  not  independent  of  one  anoth¬ 
er.  In  this  research  a  simulation  model  of  a  limited  capacity  repair  facility  with 
prioritized  scheduling  is  used  to  explore  a  variety  of  heuristic  approaches  to  the 
spares  stocking  decision.  The  heuristics  are  also  compared  with  use  of  a  model 
requiring  independent  repair  times  (even  though  that  assumption  is  not  valid 
here).  The  results  show  that  even  when  repair  time  dependencies  are  present, 
the  performance  of  a  model  which  assumes  independent  repair  times  is  quite 
good. 


i.  INTRODUCTION 

This  paper  focuses  on  the  problem  of  determining  a  spares  stocking  policy  for  a  repairable 
item  inventory  system  with  dependent  repair  times.  If  repair  times  are  assumed  to  be  indepen¬ 
dent,  there  are  several  procedures  available  for  determining  an  optimal  spares  stocking  policy 
(see  12].  (5-7],  [13]).  However,  when  repair  times  are  dependent,  as  they  are  when  there  is 
limited  or  shared  repair  capacity  and  items  are  subject  to  priority  scheduling,  existing  theory  is 
no  longer  strictly  applicable. 

Repairable  inventory  systems  are  systems  in  which  failed  units  are  repaired  and  returned 
to  service  (rather  than  scrapped  and  replaced  by  new  items).  One  important  example  of  such  a 
system  is  the  rework  and  repair  process  for  jet  engines  for  commercial  and  military  aircraft.  A 
modern  jet  engine  has  a  hierarchical  (indentured)  product  structure;  it  is  composed  of  several 
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subassemblies  called  modules,  which  in  turn  are  composed  or  a  number  of  parts  called  com¬ 
ponents.  An  engine  "failure"  or  need  for  maintenance  is  caused  by  the  malfunction  of  one  or 
more  modules,  which  are  in  turn  caused  by  one  or  more  component  failures.  This  hierarchical 
product  structure  has  three  indenture  levels— the  engine,  the  modules  and  the  components. 

Consider  a  commercial  airline  with  a  large  number  of  planes  in  its  fleet.  At  random 
times,  an  engine  malfunction  is  discovered  and  the  engine  is  removed  from  the  wing  for  repair. 
If  an  identical  spare  engine  is  available  it  is  used  to  replace  the  failure  and  the  plane  is  returned 
to  service.  If  no  spares  are  available,  the  plane  is  grounded  until  a  repaired  engine  of  the  same 
type  is  completed  by  the  repair  facility. 

The  engine  which  is  removed  is  sent  to  the  repair  facility  where  one  or  more  modules 
requiring  rework  are  identified  (see  Figure  1).  These  modules  are  removed  from  the  engine 
and  are  replaced  by  identical  spares  if  they  are  available.  If  not,  the  engine  must  wait  in  its 
repair  stall  until  the  proper  modules  emerge  from  the  repair  process.  A  similar  procedure  is 
followed  for  the  components  which  require  rework.  The  component  parts  are  repaired 
(reworked)  in  the  repair  shop.  The  repair  work  content  of  modules  and  engines  is  primarily 
inspection,  disassembly  and  assembly,  and  testing,  with  little  c  10  machine  center  processing 
required. 

The  demand  for  spare  components  is  dependent  on  the  repair  requirements  for  modules; 
module  demands  are  directly  dependent  on  engine  requirements.  Performance  of  this  system  is 
a  function  of  three  major  factors: 

1.  The  target  spares  inventory  levels  for  engines,  modules  and  components.  The  func¬ 
tion  of  spares  inventory  is  to  provide  system  support  while  the  faulty  items  are 
repaired. 

2.  The  capacity  to  repair  components  and  to  perform  inspection,  assembly,  and  test¬ 
ing  on  modules  and  engines.  Larger  shop  capacity  results  in  shorter  average  shop 
repair  times  (called  cycle  times),  but  of  course,  a  lower  percentage  utilization  of 
capacity. 

3.  The  priority  scheduling  system  used  in  the  repair  shop.  A  responsive  scheduling 
system  should  reduce  repair  times  for  those  components  which  are  currently  in 
short  supply  while  allowing  less  critical  components  to  be  repaired  more  slowly. 
Priorities  will  change  over  time  as  different  parts  become  critical. 

A  previous  study  [4]  examined  the  effects  of  alternative  scheduling  policies  on  the  sys¬ 
tem.  In  this  study,  the  repair  shop  capacity  and  the  priority  scheduling  policy  are  held  constant 
and  various  procedures  for  determining  improved  spares  stocking  policies  are  investigated. 

2.  RELEVANT  LITERATURE 

One  of  the  most  important  problems  in  a  repairable  inventory  system  is  the  determination 
of  an  optimal  spares  stocking  policy,  given  a  total  spares  budget.  A  thorough  review  of  the 
research  work  on  this  problem  has  recently  been  provided  by  Nahmias  [8],  Much  of  this  litera¬ 
ture  is  concerned  not  only  with  optimal  spares  levels  in  a  hierarchical  product  structure,  but 
also  the  allocation  of  these  spares  to  various  geographically  separate  locations  (echelons)  in  a 
decentralized  logistical  support  system. 
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Figure  1.  Schematic  of  repairable  inventory  flow. 


The  usual  control  system  here  is  a  continuous  review,  one-for-one  (S  -  1,  S)  ordering 
policy.  Thus,  whenever  an  item  fails,  an  order  is  immediately  placed  for  a  replacement  part  of 
the  same  type,  with  an  order  quantity  of  one,  and  the  failed  item  is  sent  immediately  to  the 
repair  shop  (i.e.,  batching  items  for  repair  is  not  economical) 
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Sherbrooke’s  METRIC  model  [14]  provided  an  important  analytical  treatment  of  the 
optimal  spares  problem.  His  model  determines  an  optimal  spares  policy  for  a  system  consisting 
of  a  central  repair  depot  with  several  subordinate  bases.  His  objective  was  to  minimize 
expected  backorders  for  all  items  (components  as  well  as  engines).  Muckstadt  [S]  subsequently 
noted  that  only  backorders  for  engines  affect  the  availability  of  the  aircraft.  Muckstadt’s  formu¬ 
lation,  called  MOD-METRIC,  minimized  expected  backorders  for  the  end  item,  i.e.,  engines, 
subject  to  an  investment  constraint  on  total  dollars  allocated  to  both  the  engine  and  module 
spares. 

The  following  assumptions  are  made  in  Sherbrooke’s  and  Muckstadt’s  models: 

(1)  Demands  are  generated  by  a  stationary  Compound  Poisson  process. 

(2)  A  single  module  fails  in  each  engine. 

(3)  Repair  times  for  all  items  are  independent  (i.e.,  there  is  no  waiting  or  batching 
before  failed  items  begin  the  repair  process). 

The  assumption  of  independent  repair  times  enabled  Sherbrooke  and  Muckstadt  to  utilize 
a  queuing  theory  result  called  Palm’s  Theorem  [9],  which  states  that  if  failures  are  generated  by 
a  stationary  Poisson  process  and  repair  times  are  independent,  identically  distributed  random 
variables,  then  the  steady-state  number  of  items  undergoing  repair  at  any  given  time  is  also 
Poisson  with  a  mean  equal  to  the  product  of  the  item’s  failure  rate  and  its  mean  repair  time. 
These  models  and  related  research  require  assumption  (3)  of  independence  of  repair  times. 
For  a  shop  with  limited  or  shared  capacity,  repair  times  will  not  be  independent.  Therefore, 
these  models  are  not  directly  applicable  to  the  problem  considered  here. 

Another  set  of  models,  the  so-called  "machine  repair"  models,  can  represent  queuing  in 
the  repair  shop.  These  models  assume  that  there  are  m  operational  machines  (engines),  n 
identical  spares,  and  S  >  1  parallel  servers  or  repair  stations.  When  a  failure  occurs,  it  queues 
for  repair.  Utilizing  variations  of  this  type  of  model  and  assuming  repair  times  are  exponen¬ 
tially  distributed,  several  authors  (see  references  [1],  [2],  [3]  and  [11])  have  examined  the 
problem  of  how  many  spares  and  repair  channels  are  required  for  a  system  to  operate  at  a  given 
service  level.  However,  the  limitation  of  exponential  repair  times  is  overly  restrictive.  More¬ 
over,  these  models  cannot  represent  dynamic  priority  scheduling  rules  such  as  those  found 
most  effective  in  a  previous  study  [4], 

In  this  article,  a  simulation  model  of  a  hypothetical  repair  shop  will  be  used  to  analyze  the 
spares  stocking  problem  when  repair  times  are  dependent.  First,  several  heuristic  indices  are 
used  to  find  spares  levels  which  approximate  the  "efficient  frontier"  trading  ofT  inventory  invest¬ 
ment  versus  engine  backorders.  Next,  the  spares  budget  is  fixed  and  two  alternative  methods 
of  determining  spares  stocking  levels  are  investigated.  Finally,  the  sensitivity  of  system  perfor¬ 
mance  to  the  distribution  of  spares  among  the  three  indenture  levels  is  explored. 

3.  THE  MODEL 

The  simulation  model  used  is  identical  to  that  presented  in  Hausman  and  Scudder  [4]  to 
examine  different  priority  scheduling  rules  and  will  not  be  repeated  in  detail  here.  However,  it 
is  useful  here  to  present  those  elements  which  are  of  importance  to  this  paper. 
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The  model  contains  ten  work  centers,  each  with  limited  capacity.  The  model  simulates 
actual  queuing  in  work  centers,  thereby  representing  the  dependent  repair  times  described  ini¬ 
tially.  Table  1  presents  the  conditional  failure  rates  for  each  of  the  modules  and  components  in 
the  model.  It  is  assumed  that  a  single  module  fails  in  each  engine  and  a  single  component  fails 
in  each  module. 


TABLE  1  —  Failure  Probabilities  (Single-Failure  Model) 


Module 

Module 

Failure  Probability 

Component 

Conditional  Component 
Failure  Probability 

1 

0.1667 

n 

0.2903 

12 

0.1936 

13 

0.2903 

14 

0.2258 

1.0000 

2 

0.0833 

21 

0.3333 

22 

0.6000 

23 

0.0667 

1.0000 

3 

0.2500 

31 

0.1026 

32 

0.0256 

33 

0.1538 

34 

0.2308 

35 

0.2051 

36 

0.2308 

37 

0.0513 

1.0000 

4 

0.2778 

41 

0.0800 

42 

0.3200 

43 

0.2000 

44 

0.4000 

1.0000 

5 

0.2222 

51 

0.3200 

52 

0.1600 

53 

0.2400 

54 

0.0800 

55 

0.2000 

1.0000 

1.0000 

The  performance  criterion  used  is  expected  delay  days  for  engines  (i.e.,  time-weighted 
engine  backorders).  Mathematically  this  quantity  is  calculated  by  dividing  expected  engine 
backorders  by  the  constant  daily  mean  engine  failure  rate.  Thus,  an  equivalent  performance 
measure  is  expected  engine  backorders.  This  criterion  is  used  as  it  is  representative  of  the  most 
critical  aspect  of  system  performance— the  grounding  of  a  plan  for  lack  of  an  engine.  It  is  also 
the  criterion  found  in  the  majority  of  the  literature. 
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The  initial  spares  vector  used  in  the  model  is  obtained  using  a  combination  of  enumera¬ 
tion  and  an  approximate  technique  proposed  by  Muckstadt  [6).  Engine  and  module  spares  lev¬ 
els  are  determined  using  complete  enumeration  of  all  possible  combinations  for  a  given  total 
budget.  (Although  this  may  appear  to  involve  a  large  number  of  iterations,  the  relevant  alloca¬ 
tions  are  easily  determined  due  to  the  quasi-convexity  of  the  expected  backorder  function  as 
shown  by  Muckstadt  [6].)  For  any  given  combination,  the  remaining  budget  is  then  allocated 
to  component  spares  using  Muckstadt’s  approximation  technique  (6). 

Muckstadt’s  method  approximates  the  expected  backorder  function  by  an  exponential 
function  at  the  lowest  indenture  level  (i.e.,  components  in  our  model).  The  objective  is  to 
minimize  the  sum  of  the  expected  backorders  for  all  the  components  subject  to  a  budget  con¬ 
straint  (i.e.,  the  remaining  budget).  Specifically,  let: 

11,(5,)  -  expected  backorders  for  item  / 
given  an  intial  stocking  level  5,. 

Then 

(1)  fl,(5,)-  Z  ix  -  S,) p(x\k,R,) 


S,  —  stocking  level  for  item  i 
A,  —  failure  rate  for  item  i 
R,  -  mean  repair  time  for  item  i  _ 

p(x\\iRi )  —  Poisson  probability  with  mean  A,/?,. 

The  problem  then  is: 

min  £  Bj(Sj) 

Mi 

subject  to  Y  C,S,  <  C 
«u  / 

where 


C,  -  unit  cost  of  item  i 
C  -  available  budget. 

The  approximation  made  is: 

(2)  5,(5,)  = 

where  a,  and  b ,  are  positive  constants  to  be  determined.  This  approximation  is  shown  to  have 
high  accuracy  when  5,  >  A,5„  that  is,  when  spares  for  item  i  equal  or  exceed  the  expected 
quantity  of  item  /  in  resupply  in  the  system.  This  condition,  analogous  to  a  reorder  point  equal¬ 
ing  expected  lead  time  demand  plus  positive  safety  stock,  will  typically  hold;  i.e.,  we  expect 
positive  safety  stock,  not  negative  safety  stock.  Values  for  a,  and  b,  are  determined  by  calculat¬ 
ing  the  actual  expected  backorders  for  various  values  of  Sh  A,,  and  R,  using  (1)  and  fitting  the 
logarithm  of  the  exponential  function  in  (2)  using  linear  regression.  Note  that  this  approxima¬ 
tion  is  indirectly  affected  by  the  mean  repair  times,  R,.  For  our  purposes,  the  mean  repair 
times  obtained  from  FCFS  priority  scheduling  are  used.  The  spares  vector  produced  by  this 
procedure  is  shown  in  Table  2,  together  with  the  costs  of  each  item. 


NAVAL  RESEARCH  LOGISTICS  QUARTERLY 


VOL.  29,  NO.  2,  JUNE  19*2 


SPARES  STOCKING  POLICIES  FOR  REPAIRABLE  ITEMS 


309 


TABLE  2  —  Initial  Spares  Vector 


Item 

Unit  Cost 

Number  of  Spares 

Total  Cost 

Engine 

$2,000,000 

0 

$  0 

Module  1 

400,000 

2 

800,000 

Module  2 

308,000 

1 

308,000 

Module  3 

630,000 

2 

1,260,000 

Module  4 

490,000 

2 

980,000 

Module  S 

575,000 

2 

1,150,000 

Component  11 

135,000 

2 

270,000 

Component  12 

85,000 

3 

255,000 

Component  13 

150,000 

2 

300,000 

Component  14 

110,000 

3 

330,000 

Component  21 

80,000 

2 

160,000 

Component  22 

100,000 

4 

400,000 

Component  23 

60,000 

0 

0 

Component  31 

160,000 

1 

160,000 

Component  32 

110,000 

1 

110,000 

Component  33 

115,000 

2 

230,000 

Component  34 

90,000 

2 

180,000 

Component  35 

120,000 

2 

240,000 

Component  36 

85,000 

3 

255,000 

Component  37 

100,000 

1 

100,000 

Component  41 

140,000 

1 

100,000 

Component  42 

100,000 

4 

400,000 

Component  43 

80,000 

4 

320,000 

Component  44 

150,000 

4 

600,000 

Component  51 

140,000 

2 

280,000 

Component  52 

90,000 

2 

180,000 

Component  53 

100,000 

3 

300,000 

Component  54 

130,000 

1 

130,000 

Component  55 

110,000 

4 

440,000 

Total  Cost: 

$10,278,000 

The  priority  scheduling  rule  used  throughout  this  paper  here  is  the  NTINV2-DBL  rule 
from  [4].  This  scheduling  rule  gives  the  highest  priority  to  the  component  with  the  lowest 
value  of  net  inventory,  which  includes  the  number  of  components  of  the  same  type  further 
along  in  the  repair  process.  This  rule  outperformed  all  other  rules  examined  in  [4].  Note  that 
under  this  (or  a  similar)  rule,  repair  priorities  and  therefore  repair  times  are  directly  affected  by 
the  number  of  component  spares  of  each  type  stocked. 

4.  HEURISTICS  FOR  IMPROVED  SPARES  VECTORS 

In  this  section  we  begin  with  the  initial  spares  vector  described  above  and  attempt  to 
obtain  an  approximate  'efficient  frontier”  trading  off  reduced  investment  in  inventory  against 
higher  engine  backorders.  To  achieve  this  objective,  spares  are  removed,  one  at  a  time,  from 
the  original  spares  vector  and  each  resulting  spares  vector  is  simulated. 
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To  determine  analytically  the  mean  delay  days  resulting  from  each  potential  removal  of  a 
spare,  it  is  necessary  to  have  independent  repair  times.  Since  repair  times  in  this  model  are 
dependent,  the  exact  changes  in  mean  delay  days  cannot  be  determined  analytically.  Therefore, 
it  is  necessary  to  develop  a  heuristic  index  which  will  be  used  to  determine  a  removal  sequence 
to  be  applied  to  the  original  spares  vector.  Six  different  indices  were  developed  for  this  purpose 
(see  Table  3).  A  detailed  description  of  each  follows. 


TABLE  3  —  Definition  of  Heuristic  Indices 


Index  Symbol 

Definition 

h 

(CS)/X 

h 

(CS)/(X  W) 

h 

(CS)/(XOPS)-/,/OPS 

1 4 

S/(X  W) 

/s 

(C2S)/(X  W) 

h 

CIS/(X  HOI2 

where: 

C  “  unit  cost  of  the  component  or  module 
S  —  number  of  spares  of  component  or  module  stocked 
X  “  daily  failure  rate  of  component  or  module 
W  —  total  work  content  of  component  or  module 
(processing  time  and  estimated  queue  time) 

OPS  —  number  of  operations  required  to  return  the  component 
to  a  serviceable  state. 

Index  one  (/()  is  the  ratio  of  the  unit  component  cost  times  the  current  number  of  spares 
divided  by  the  daily  failure  rate.  Thus  the  index  equals  current  investment  in  component 
spares  inventory  divided  by  the  failure  rate,  or  S-days  of  investment  per  failure.  The  intuitive 
motivation  for  this  index  is  that  both  unit  cost  and  the  current  spares  level  should  be  directly 
related  to  a  willingness  to  delete  a  spare  unit,  while  the  failure  rate  should  be  inversely  related. 
The  specific  form  of  the  index  argues  (heuristically,  with  no  optimization  claims)  that  $-days  of 
inventory  investment  per  failure  should  tend  to  be  equalized  across  items. 

The  index  is  calculated  for  each  of  the  modules  and  components  and  then  ranked  from 
highest  to  lowest  (as  are  all  subsequent  indices).  One  spare  of  the  item  with  the  largest  index 
value  is  removed  from  the  current  spares  vector.  As  stated  above,  this  results  in  the  removal 
of  a  unit  having  .either  a  relatively  high  unit  cost,  a  relatively  high  current  number  of  spares,  or 
a  relatively  low  failure  rate. 

Index  two  (/2)  is  l\  divided  by  the  total  work  content  (total  expected  shop  processing 
time,  including  estimated  queue  time)  for  each  component  type.  Here  total  work  content  is 
used  as  a  surrogate  for  total  repair  time.  Application  of  this  index  to  modules  and  engines  is 
ambiguous  since  total  work  content  for  them  is  not  well-defined.  The  actual  work  required  for 
each  of  these  is  merely  assembly,  but  there  may  be  a  delay  waiting  for  a  lower-indenture-level 
part  which  increases  the  total  expected  processing  time.  Since  there  is  only  a  single  component 
failure  in  each  module,  the  module  work  content  is  estimated  by  adding  the  actual  module 
assembly  time  to  a  weighted  average  of  the  total  expected  processing  time  for  each  of  the 
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subordinate  components.*  A  similar  procedure  is  used  to  calculate  the  average  work  content 
for  an  engine.  Thus,  at  all  three  levels,  total  work  content  is  a  surrogate  measure  for  actual 
repair  time.  Since  the  actual  repair  time  for  two  identical  units  may  be  quite  different  with 
priority  scheduling,  the  work  content  measure  may  not  be  accurate. 

The  /2  index  also  lends  itself  to  an  intuitive  interpretation.  Since  X  is  the  expected  daily 
failure  rate,  5/X  equals  the  expected  days  of  coverage  of  demand  that  S  spares  provide  for  a 
given  unit.  Division  by  W  expresses  this  coverage  in  units  of  work  content.  Thus,  S/X  x  W  is 
a  dimensionless  value  which  weights  the  cost  of  each  item  by  its  approximate  demand  coverage 
given  the  current  spares  vector.  If  two  items  have  equal  coverage  (in  units  of  work  content), 
the  more  expensive  item  is  removed  first  from  the  spares  vector. 

Index  three  (/3)  follows  the  same  logic  as  /2 ,  but  uses  the  number  of  operations  required 
as  the  surrogate  for  repair  times.  This  index  assumes  a  monotonic  relationship  between  the 
number  of  operations  and  the  total  repair  time  required.  (In  reality,  there  may  be  a  positive 
correlation  between  C,  the  cost  of  the  item,  and  the  number  of  operations  required.  In  this 
model,  these  values  have  been  generated  independent As  the  number  of  operations 
increases,  the  amount  of  queuing  also  tends  to  increase.  Therefore,  total  expected  repair  time 
can  also  be  expected  to  increase  since  it  is  the  sum  of  queuing  time  and  processing  time. 

The  fourth  index  (/4)  is  similar  to  /2,  but  it  ignores  the  cost  of  the  unit.  This  index 
removes  items  with  relatively  low  failure  rates  or  total  work  content,  regardless  of  their  cost. 

Index  five  (/5)  and  Index  six  (/«)  are  extensions  of  I2.  Index  five  gives  more  weight  to 
unit  cost  by  squaring  it.t  This  results  in  the  removal  of  more  higher-cost  items  earlier  in  the 
removal  process  than  if  /2  were  used.  For  a  given  budget  level,  the  total  number  of  spares 
stocked  over  all  items  tends  to  be  slightly  higher,  compared  to  /2. 

Index  six  assigns  an  exponent  of  2  to  the  coverage  value:  (5/X  x  tV)2.  This  is  a  method 
for  examining  the  effect  of  removing  more  of  the  items  with  better  coverage  earlier  in  the 
removal  process.* 

Results 

Each  index  was  used  to  remove  spares,  one  at  a  time,  from  the  original  spares  vector. 
The  resulting  spares  vector  was  then  simulated.  The  resulting  mean  delay  day  values  for  each 
of  the  indices  are  shown  in  Figure  2  for  various  budget  levels.  (Each  observation  point 
represents  the  outcome  of  one  of  these  simulations.)  Index  S,  similar  to  Index  2  but  with  the 
unit  cost  squared,  provides  a  different  spares  removal  pattern,  but  produces  results  similar  to 
those  of  Index  2.  Index  1,  which  does  not  consider  work  content,  and  Index  3,  where  the 
number  of  operations  is  used  as  a  measure  of  work  content,  perform  almost  0.S0  days  worse, 
on  average,  when  the  budget  is  decreased  by  $1  million.  Index  4  did  not  show  any  promise  for 
improvement  in  the  first  seven  iterations  and  was  eliminated  from  further  testing.  The  results 

’Since  «  needed  component  will  sometimes  be  available  from  component  inventory,  this  measure  of  module  (and  on¬ 
line)  work  content  will  generally  overestimate  actual  processing  time. 

tPowers  greater  than  2  were  examined,  but  with  the  high  cost  values  in  this  problem,  the  Cx  term  dominates  and  the 
index  degenerates  into  one  using  only  unit  cost  as  the  removal  criterion. 

♦Powers  of  2  and  4  were  examined,  but  the  resulting  spares  vectors  have  considerable  overlap.  Only  results  concerning 
the  power  of  2  are  presented  here. 
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Figure  2.  Mean  delay  days  for  five  indices  for  various  budget  levels. 
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for  Index  6  are  not  shown  in  Figure  2  due  to  their  similarity  to  other  results;  at  S9.6  million  the 
results  coincide  with  those  of  Index  2,  and  below  that  budget  level.  Index  6  coincides  with 
Index  4  in  several  cases. 

Based  on  Figure  2’s  results,  Index  2  and  Index  5  seem  to  be  the  best  indices  of  those 
examined  here  for  this  problem. 

5.  IMPROVED  PERFORMANCE  WITH  THE  SAME  BUDGET 

Since  the  theoretical  model  used  to  determine  the  initial  spares  vector  requires  indepen¬ 
dent  repair  times,  it  seems  likely  that  improved  delay-day  performance  could  be  possible  for  the 
same  budget  of  S10.3  million.  Two  different  approaches  are  used  to  explore  this  issue. 

Spares  Convergence  Using  Muckstadt's  Approximation 

One  approach  for  testing  whether  improved  delay-day  performance  can  be  obtained  is  to 
use  Muckstadt's  Approximation  in  a  manner  for  which  it  was  not  designed,  namely,  iteratively 
with  the  mean  times  obtained  using  a  dynamic  scheduling  rule.  (Note  that  the  initial  spares 
vector  is  based  on  the  mean  repair  times  obtained  using  the  FCFS  scheduling  rule.)  Thus  the 
original  spares  vector  of  Table  2  was  resimulated  using  the  NTINV2-DBL  priority  scheduling 
rule.  The  resulting  (different)  set  of  mean  repair  times  was  then  used  to  calculate  a  new  spares 
vector  using  Muckstadt's  Approximation  with  enumeration  as  described  earlier.  This  new 
spares  vector  was  then  simulated  (using  the  NTINV2-DBL  rule)  and  the  entire  process  repeated 
until  the  same  spares  vector  was  obtained  in  two  successive  iterations.  Although  the  mean 
repair  time  for  any  component  will  vary  on  successive  runs  since  the  scheduling  rule  being  used 
is  dependent  on  the  number  of  spares  stocked,  at  some  iteration  the  variations  may  not  be 
sufficient  to  alter  the  spares  vector.  Convergence  is  defined  to  occur  when  two  successive 
spares  vectors  are  identical. 

In  Table  4,  the  results  of  applying  this  procedure  are  shown.  S0  is  the  initial  spares  vector 
shown  in  Table  2.  S4  is  the  final  spares  vector  that  is  obtained  when  either  the  R*  or  the  R$ 
repair  times  are  used  as  input  to  the  Muckstadt  procedure.* 

There  are  two  major  observations  to  be  made  from  Table  4.  First,  the  mean  delay-day 
value  to  which  this  procedure  converges  (3.88)  is  actually  worse  than  the  original  value  (3.80). 
While  this  is  unlikely  to  be  a  statistically  significant  difference,  it  does  show  that  the  use  of  the 
FCFS  mean  repair  times  in  calculating  the  original  spares  vector  provides  relatively  good  perfor¬ 
mance. 

The  second  observation  relates  to  the  apparent  paradox  which  exists  when  attempting  to 
use  any  procedure  based  on  independent  repair  times  when  repair  times  are  actually  dependent 
and  are  at  least  partially  based  on  perceived  need.  The  NTINV2-DBL  priority  scheduling  rule  is 
based  directly  on  spare  inventory  levels.  As  the  number  of  spares  of  a  component  increases, 
the  component  receives  relatively  lower  priorities  in  the  repair  shop.  These  in  turn  lead  to 
longer  observed  repair  times  (i.e.,  with  more  spares,  net  inventory  is  higher  and  module  needs 


values  for  engines  and  modules  are  calculated  in  the  model  and  thus  are  not  required  as  input  data  for  this  conveT' 
lence  testing. 
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TABLE  4  —  Convergence  of  Spares  Vectors  Using  Muckstadt's  Approximation  Iteratively 


Iteration 

0 

1 

2 

3 

4 

5 

Item 

El 

Si 

*3 

SL 

R* 

El 

mm 

rm 

Engine 

0 

— 

0 
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0 

0 

— 

0 

— 

■fl 

Module  1 

2 

— 

1 

— 

1 

— 

1 

— 

1 

— 

l 

Module  2 

1 

— 

1 

— 

2 

— 

1 

— 

1 

— 

l 

Module  3 

2 

— 

2 

— 

2 

— 

2 

— 

2 

— 

Module  4 

2 

— 

2 

— 

2 

— 

2 

— 

2 

— 

Module  S 

2 

— 

1 

— 

1 

— 

1 

— 

1 

— 

l 

Component  11 

2 

32.9 

2 

27.4 

2 

28.4 

2 

27.5 

2 

27.3 

2 

Component  12 

3 

112.8 

4 

119.7 

4 

117.3 

4 

113.7 

4 

111.5 

Component  13 

2 

37.2 

2 

33.7 

2 

32.9 

2 

33.0 

2 

33.5 

Component  14 

3 

111.1 

4 

117.2 

4 

113.8 

4 

111.9 

4 

111.7 

Component  21 

2 

38.8 

2 

32.4 

1 

23.6 

1 

23.2 

1 

23.8 

Component  22 

4 

121.5 

5 

133.0 

6 

159.4 

7 

187.2 

8 

214.0 

Component  23 

0 

21.8 

0 

22.1 

0 

21.6 

0 

21.8 

0 

22.8 

Component  31 

1 

29.4 

1 

24.5 

1 

26.2 

1 

24.2 

1 

24.4 

Component  32 

1 

41.7 

1 

33.7 

0 

33.4 

0 

30.6 

0 

32.6 

o 

Component  33 

2 

44.9 

2 

39.3 

2 

37.2 

2 

36.7 

2 

36.2 

Component  34 

2 

36.8 

3 

38.2 

2 

27.6 

2 

26.6 

2 

25.9 

Component  35 

2 

45.8 

3 

50.8 

3 

49.9 

3 

48.5 

3 

47.4 

Component  36 

3 

49.5 

3 

41.6 

3 

40.0 

3 

39.9 

3 

38.0 

Component  37 

1 

24.7 

1 

22.3 

1 

22.6 

1 

21.5 

1 

21.6 

1 

Component  41 

1 

21.9 

1 

18.5 

1 

17.6 

1 

16.3 

1 

16.9 

1 

Component  42 

4 

57.7 

5 

64.7 

5 

64.0 

5 

62.3 

5 

61.2 

Component  43 

4 

104.3 

5 

109.9 

5 

110.3 

6 

130.3 

6 

128.0 

Component  44 

4 

48.4 

5 

56.3 

5 

55.4 

5 

54.6 

5 

53.8 

Component  51 

2 

25.0 

2 

19.9 

2 

20.3 

2 

19.0 

2 

19.0 

Component  52 

2 

39.0 

2 

33.8 

2 

31.6 

2 

30.3 

2 

29.8 

Component  53 

3 

63.7 

4 

69.9 

4 

71.0 

4 

69.4 

4 

68.3 

Component  54 

1 

29.7 

1 

27.1 

1 

28.0 

1 

27.3 

1 

27.3 

1 

Component  55 

4 

129.1 

5 

140.5 

5 

142.4 

5 

135.2 

5 

134.0 

5 

Mean 

Delay  Days 

3.80 

3.88 

3.85 

3.91 

3.88 

3.88 

for  components  are  satisfied  more  frequently  from  stock).  A  procedure  like  Muckstadt’s  which 
relies  on  the  independence  of  these  repair  times  will  purchase  additional  spares  of  an  item  if  the 
repair  time  increases  significantly.  Therefore,  a  feedback  loop  is  created  in  which  more  spares 
are  purchased,  lower  priorities  and  longer  repair  times  result,  and  possibly  more  spares  are  pur¬ 
chased. 
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In  Table  4  this  behavior  is  observed  for  component  22.  The  number  of  spares  for  this 
component  is  increased  by  one  in  each  iteration  and  the  mean  repair  time  increases  each  time 
as  well.  Since  the  marginal  decrease  in  expected  backorders  is  diminishing  as  each  additional 
spare  is  added,  the  procedure  eventually  converges  to  the  S4  spares  vector.* 

Index  Equalization 

A  second  method  which  was  examined  to  determine  a  better  spares  vector  for  a  given 
budget  is  Index  Equalization.  Index  Equalization  is  a  process  where  the  number  of  spares  of 
each  component  type  are  set  such  that  the  index  number  values  (as  defined  in  Table  3)  for  all 
components  are  approximately  equal.  For  example,  if  Index  2  (CS)/(kW)  was  being  used, 
C/(\  W)  would  be  calculated  for  each  component.  (This  value  does  not  vary  with  the  number 
of  spares) .  Then  the  number  of  spares,  S,  is  increased  for  each  component  until  Index  2  value 
is  as  close  as  possible  to  some  predetermined  value.  (Due  to  the  integrality  of  the  spares  stock¬ 
ing  levels  allowed,  it  is  not  possible  for  all  index  values  to  be  precisely  equal.) 

Since  Index  2  and  Index  5  provided  similar  performance  in  spares  reduction,  they  were 
selected  for  initial  Index  Equalization  testing.  Each  index  was  used  to  calculate  component 
spares  for  budgets  of  $10.3  million  (the  original  budget),  $9  million  and  $8  million.  A  particu¬ 
lar  budget  level  is  obtained  by  varying  the  predetermined  target  index  equalization  value  until 
enough  spares  are  stocked  to  match  the  desired  budget.  For  example,  if  an  index  value  of  500 
results  in  a  budget  of  $7  million  being  used,  increasing  the  index  value  to  600  will  increase  the 
budget  used. 

The  resulting  spares  vectors  were  simulated  using  the  NTINV2-DBL  rule  with  the  result¬ 
ing  mean  delay-day  values  shown  in  Figure  3.  (For  these  tests,  the  engine  and  module  spares 
remained  at  the  levels  shown  in  Table  2.)  Index  2  performed  slightly  better  than  Index  5  at 
budgets  of  $8  million  and  $10.3  million,  while  at  $9  million  they  were  essentially  equal. 
Although  both  index  equalization  procedures  provided  marginally  improved  performance  over 
the  original  spares  vector,  it  should  be  noted  that  Muckstadt's  Approximation  performs  almost 
as  well,  even  though  it  assumes  the  dependent  repair  times  from  the  FCFS  scheduling  rule  are 
independent.  Index  2  was  selected  for  further  testing  and  is  used  for  all  remaining  results  in 
this  section. 

When  using  Index  Equalization  there  are  two  issues  of  importance  related  to  the  accuracy 
of  the  parameters  being  used  in  calculating  the  indices.  These  are  the  estimation  of  repair 
times  by  work  content  and  the  lack  of  a  good  measure  of  work  content  for  modules  and 
engines.  The  first  issue  was  discussed  when  the  indices  were  defined.  The  work  content  for 
modules  and  engines  clearly  must  include  the  actual  assembly  times  (2.5  days  for  engines,  5.0 
days  for  all  modules).  However,  these  alone  are  not  adequate  to  reflect  relative  repair  times, 
since  repair  times  of  modules  (or  engines)  may  also  consist  of  waiting  time  while  the  proper 
components  (or  modules)  are  repaired. 

As  stated  above,  one  possible  procedure  is  to  estimate  module  work  content  as  the  S-day 
assembly  time  plus  a  weighted  average  of  the  work  content  of  the  components  comprising  the 
module.  For  example,  if  a  module  consists  of  only  2  components  with  relative  failure  rates  of 


•The  conversance  procedure  wu  tested  for  edditionel  scheduling  rules  with  similar  results.  Convergence  in  all  cases 
was  achieved  in  S  or  fewer  iterations.  In  other  situations  convergence  may  not  occur;  cycling  may  take  place.  The  con¬ 
vergence  question  is  moot  since  the  performance  of  this  approach  is  actually  degraded. 
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0.25  and  0.75  and  work  content  of  2  days  and  4  days,  respectively,  the  module  work  content 
would  be  estimated  as  5  +  (0.25)  (2)  +  (0.75)  (4)  —  8.5  days.  This  value  overstates  the 
module  work  content  at  least  25%  of  the  time  (i.e.,  when  the  first  component  fails).  Also, 
when  there  is  a  positive  on-hand  inventory  of  the  required  component,  the  actual  repair  time  is 
only  5  days.  If  the  system  does  stock  spares  of  the  component,  the  module  demand  may  also 
be  filled  by  a  component  which  has  already  completed  one  or  more  operations.  Clearly,  actual 
module  repair  times  are  a  function  of  the  number  of  component  spares  and  a  surrogate  meas¬ 
ure,  such  as  work  content,  is  not  easily  calculated.  For  these  reasons,  another  approach  will  be 
examined  below. 

Since  Muckstadt’s  Approximation  appeared  to  provide  a  good  spares  vector  for  the  $10.3 
million  budget,  it  might  also  be  reasonably  good  at  other  budget  levels.  Thus  the  approxima¬ 
tion  was  used  to  calculate  the  spares  vectors  for  budgets  of  $8  millions,  $8.5  million,  $9  mil¬ 
lion,  and  $9.5  million  using  the  FCFS  mean  repair  times.  Index  2  equalization  was  also  used  to 
calculate  spares  vectors  for  these  four  budget  levels,  but  here  the  Index  Equalization  technique 
was  restricted  to  have  constant  module  spares,  while  Muckstadt’s  method  applied  to  modules  as 
well  as  components  decreases  total  module  spares  by  one  for  each  budget  decrease.  These 
spares  vectors  were  simulated  and  the  resulting  mean  delay  days  are  plotted  in  Figure  4.  With 
fixed  module  spares.  Index  Equalization  performance  degrades  relative  to  Muckstadt’s  Approxi¬ 
mation  as  the  budget  is  dec'eased.  This  is  due  to  overspending  on  modules  at  the  lower  budget 
levels. 

As  an  alternative.  Index  Equalization  was  used  for  components  only,  while  the  module 
spares  were  those  generated  by  Muckstadt’s  Approximation.  When  simulated,  these  spares  vec¬ 
tors  resulted  in  the  delay-day  values  plotted  in  Figure  5.  In  this  case.  Index  Equalization  pro¬ 
vides  better  performance  in  all  cases.  Therefore,  it  appears  that  a  good  procedure  is  to  use 
Muckstadt’s  model  with  enumeration  to  determine  the  upper  level  spares  (engines  and 
modules)  and  Index  Equalization  to  allocate  the  remaining  budget  to  the  purchase  of  spare 
components.* 

6,  SENSITIVITY  ANALYSIS  OF  BUDGET  ALLOCATION  TO  ENGINE, 

MODULE,  AND  COMPONENT  SPARES  LEVELS 

When  repair  times  are  independent,  an  analytical  model  like  that  of  Muckstadt  will  deter¬ 
mine  an  optimal  allocation  of  the  available  budget  between  engine,  module  and  component 
spares,  as  well  as  optimal  allocations  of  spares  within  each  level.  It  is  instructive  to  explore  the 
sensitivity  of  spares  vector  performance  to  deviations  from  the  optimal  allocation  of  the  budget 
among  the  three  levels  (whether  repair  times  are  independent  or  dependent)  while  allocating 
appropriately  within  each  level.  This  sensitivity  was  explored  for  the  dependent  repair  times 
which  exist  in  this  model. 

In  this  portion  of  the  study,  engine  and  module  spares  were  varied  one  at  a  time  and  com¬ 
ponent  spares  for  the  remaining  budget  were  calculated  using  Index  2  equalization  as  previously 
described.  A  second  index  number  was  used  to  calculate  the  desired  module  vector  for  each 


•Since  module  and  engine  spare  levels  are  based  on  an  expected  resupply  time  which  consists  of  a  constant  assembly 
time  plus  the  lower  level  expected  delay.  Muckstadt's  method  (which  assumes  independent  repair  times)  should  pro¬ 
vide  reasonable  performance  when  the  assembly  time  is  a  major  percentage  of  the  overall  expected  resupply  time. 
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Budget  (Millions) 

Figure  4.  Comparison  of  Muckstadt's  Approximation  and  Index  2 
equalization  with  fixed  module  spares. 
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iteration.  This  index  is  based  on  the  ratio  of  the  cost  (C)  and  overall  failure  rate  (A)  for  each 
module.  (These  values  are  given  in  Table  2  and  Table  1  respectively.)  Specifically,  this  ratio 
was  used  to  determine  which  module  spare  should  be  the  next  to  add  when  varying  the  module 
spares  vector  upward  from  zero.  For  example,  if  CV'Ai  —  3  and  C-J\i  —  4,  module  1  would  be 
added  before  module  2.  Engine  spares  were  initially  zero  (the  value  used  in  all  previous  simu¬ 
lations);  engine  spare  levels  of  one  and  two  were  also  examined. 

Several  spares  vectors  were  generated  for  each  of  the  engine  spares  levels.  Each  vector 
was  simulated  using  the  NTINV2-DBL  scheduling  rule  and  the  resulting  mean  delay  days  are 
plotted  in  Figure  6.  Each  point  in  the  upper  half  of  Figure  6  represents  the  observed  mean 
delay  days  for  0,  1,  or  2  engines,  with  the  remaining  budget  allocated  to  components  and 
modules  as  shown  in  the  lower  quadrant.  (The  lines  in  the  lower  quadrant  are  budget  lines; 
addition  of  1  engine  shifts  the  line  towards  the  origin  by  $2  million.) 

The  best  performance  (3.61  delay  days)  occurs  when  one  engine  is  stocked  and  approxi¬ 
mately  $5.4  million  is  spent  on  component  spares.  However,  this  is  only  slightly  better  than 
two  of  the  spares  vectors  with  no  engines  (3.62  days  and  3.63  days)  and  the  next  spares  vector 
with  one  engine  (3.65  days)  which  spends  $5.9  million  on  components.  The  original  Muckstadt 
module  spares,  with  Index  2  equalized  component  spares,  performs  almost  as  well  (3.67  days). 
Thus,  there  are  five  spares  vectors,  each  with  component  spares  budgets  in  a  range  of  $5  mil¬ 
lion  to  $6  million,  which  result  in  nearly  identical  performance.  As  more  money  is  allocated  to 
component  spares,  the  performance  with  zero  engine  spares  degraded  much  more  slowly  than 
when  one  engine  is  stocked.  When  less  than  $5  million  is  spent  on  components,  these  two 
engine  stocking  levels  provide  similar  results  over  the  range  examined.  Stocking  two  engine 
spares  clearly  degrades  performance,  with  minimum  mean  delay  days  of  5.25,  more  than  1.5 
days  worse  than  the  minimum  with  one  engine. 

CONCLUSIONS 

In  this  article  we  have  examined  different  methods  of  determining  a  desirable  spares 
inventory  choice  when  repair  times  are  dependent.  Muckstadt’s  Approximation  was  used  itera¬ 
tively  with  the  dependent  repair  times  obtained  using  a  dynamic  priority  scheduling  rule  until 
convergence  to  a  single  spares  vector  was  obtained.  However,  no  improved  spares  vector  was 
generated  at  any  of  the  five  iterations.  Index  Equalization  was  applied  to  the  component  spares 
vector  and  performance  was  improved  by  0.13  days,  on  average  —  only  a  marginal  improve¬ 
ment.  Using  Muckstadt's  Approximation,  spares  vectors  were  generated  for  various  budget  lev¬ 
els.  The  engine  and  module  spares  levels  generated  were  used,  but  Index  Equalization  was 
used  to  determine  the  component  spares  levels.  The  equalized  spares  vectors  provided  some¬ 
what  better  performance  than  Muckstadt  alone  in  all  cases.  However,  it  was  noted  that 
Muckstadt’s  model  appears  to  be  very  robust,  even  with  dependent  repair  times.  The  robust¬ 
ness  of  the  METRIC  model  when  expediting  was  used  has  been  shown  by  Sherbrooke  [12]. 
The  results  presented  here  also  support  this  conclusion. 

Finally,  it  was  shown  that  several  equally  good  spares  vectors  exist  for  this  problem. 
While  engine  and  module  spares  for  these  better  vectors  vary  widely,  the  component  budgets 
lie  in  a  narrower  range  (5  to  6  million  dollars).  Thus,  the  main  tradeoff  flexibility  for  the  be  > r 
spares  vectors  occurs  at  the  engine  and  module  levels  when  appropriate  within-level  allocations 
are  made. 
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ABSTRACT 

It  is  shown  that  Krakowski's  relevation  transform  generates  the  nonhomo- 
geneous  Poisson  process  in  an  analogous  fashion  to  the  way  in  which  Slieltjes 
convolution  generates  the  renewal  process.  Properties  of  failure  rates  and  of 
inter-failure  times  are  discussed  and  an  application  to  warranty  analysis  is 
described. 


I.  INTRODUCTION 

The  (Stieltjes)  convolution  of  two  distribution  functions  with  support  on  the  nonnegative 
axis,  Fand  Gsay,  is  denoted 

F  •  G(t)  -  fo  F(t  -  u)g(u)  du 

if  g  -  G'  exists.  It  is  the  distribution  function  of  the  time  to  failure  of  the  second  of  two  com* 
ponents  when  the  second  component  (with  life  distribution  G)  is  placed  in  service  on  the 
failure  of  the  first  (with  life  distribution  f)\  the  replacement  component  is  assumed  to  be  new 
on  installation.  Suppose,  however,  that  we  replace  the  failed  component  by  one  of  equal  age. 
The  survivor jfunction  of  the  time  to  system  failure  (i.e.,  both  components  are  failed)  is  the 
relevation  of  F  and  G,  the  survivor  functions  of  the  first  and  second  components,  respectively, 
and  is  denoted 

F  #  GO)  -  FO)  +  /0'  f(u)  du 

assuming  that  /  «■  F'  exists.  (The  assumption  that  /  and  g  exist  is  not  necessary,  but  is  typi¬ 
cally  reasonable  in  practice  and  simplifies  the  presentation.)  This  concept  was  introduced  by 
Krakowski  [11]  who  studied  some  properties  of  what  he  calls  the  relevation  transform.  In  this 
paper  we  consider  the  stochastic  process  generated  by  the  successive  failures  of  a  component 
which,  on  failure,  is  replaced  by  a  component  of  equal  age  or,  equivalently,  in  which  the  com¬ 
ponent  is  instantaneously  restored  to  its  condition  immediately  prior  to  failure.  The  latter 
interpretation  provides  us  with  a  model  of  the  failure  pattern  of  a  component  for  which  the 
repair  time  is  negligible  in  comparison  with  the  failure  time,  which  may  be  more  realistic  than 
the  renewal  process  in  certain  instances.  A  similar  model  is  described  by  Ascher  [1];  see  also 
Thompson  (141. 

Further  developments  of  the  relevation  transform  are  discussed  by  Johnson  and  Kotz  [9]-, 
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2.  THE  TIME  TO  THE  nth  FAILURE 


Let  F(„)  denote  the  survivor  function  of  X„,  the  time  to  the  nth  failure.  Clearly, 

1  -  UU)  n-Q 

FU)  n-  1  , 

F(„_,>  #  F(f)  n>2 

where  F  is  the  life  distribution  of  a  new  component  and  U  is  the  Heaviside  function.  We  can 
interpret  FM(t)  as  the  survivor  function  of 

(a)  the  time  to  the  nth  failure  given  that  a  failed  component  is  replaced  by  one  of 
equal  age 

(b)  the  time  to  the  nth  failure  given  that  instantaneous  repair  to  the  condition 
immediately  prior  to  failure  is  performed  at  each  breakdown 


FMU) 


(c)  the  time  to  the  nth  failure  of  a  system  ior  which  there  is  an  arbitrarily  large 
number  of  spares  on  hot  standby. 


An  expression  for  Fin)U)  follows  on  observing  that 

_  n-1 

F(/l)(f)  -  £  P I*  failures  in  (0,  d) 
*-o 


and  hence 


(1) 


F{n ,(f)  -  FU)  + 


I 


r'  /(ui) 
Jo  F(«,) 


r'  /(«2> 
f(u2) 


/(«*) 

F(uk) 


FU) 


x  duk  . . .  du2du ( 

for  0  —  u0  <  “i  <  U'i  <...<«*<  r.  Define  A(x>  -  f(x)/F(x)  (the  hazard  function) 
where  /  -  F'  and 

A(x)-J^  A(u)  du  -  -  log  F(x) 

(the  cumulative  hazard  function).  After  some  straightforward  but  lengthy  algebra,  (1) 
simplifies  to 

(2)  ^,„>(f)  “  FU)  2;  lA(f))V*t 

*-o 


Krakowski  (11]  proves  (2)  by  means  of  an  induction  argument. 


If  p(r)  denotes  the  number  of  failures  in  (0,  r]. 


pU(r)  -  ») 


f(r)[A(r)]"  (fl 


0,  1,2.  ...), 


i.e.,  (v(r).  t  >  0)  forms  a  nonhomogeneous  Poisson  process  with  intensity  function  A (r). 
(This  result  also  follows  by  making  an  obvious  transformation  of  the  time  scale  of  a  Poisson 
process  and  is  noted  by  Barlow,  Proschan  and  Scheuer  [3];  see  also  Parzen  [13],  page  126.) 


Thus,  we  see  that  the  relevation  transform  generates  the  nonhomogeneous  Poisson  pro¬ 
cess  in  a  manner  analogous  to  the  way  in  which  the  convolution  operation  generates  the 
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renewal  process.  Observe  that  the  sequence  of  failures  generated  by  the  relevation  transform 
constitutes  a  renewal  process  if  and  only  if  F  is  exponential;  this  follows  immediately  from  the 
characterization  theorem  of  Grosswald,  Kotz  and  Johnson  [8]. 


On  differentiating  (2),  we  obtain  the  following  expression  for  /<„)(»),  the  density  of  X„: 
(3)  (n- 1.2,3,  ...), 


defining  /«»(')  to  be  the  Dirac  delta  function.  Substituting 
A  (/)  -  £  IF(/)1*/*. 


*-i 


which  follows  from  the  identity  -log  (1  -  x)  -  £  xk/k,  into  (3),  we  see  that 


/<„,(/)-  £  akkf(t)lFU))k-' 

k~n 

for  suitable  constants  {aj,  i.e.,  /(„)(f )  is  an  infinite  mixture  of  the  densities  of  the  largest 
order  statistics  of  a  random  sample  size  k(k  -  n,  n  +  1,  n  +  2,  . . .)  from  the  population  with 
distribution  function  F. 


On  integrating  (3),  we  obtain  the  well-known  identity 
£{lA(2f)]"}  -  T (n  +  1), 


where  JIT  is  the  random  variable  with  distribution  function  £;  it  follows  from  this  and  (2)  that 
£{£,„,(*)/£(*))  -  n. 


The  joint  distribution  of  X„  and  X„+„  is  readily  found.  Clearly,  the  survivor  function  of 
X„+m  conditional  on  (Jf„  -  /)  is 

_  rf_i  #t-l 

F(m)(r\t)  -  £  (A(t)  -  A(r))‘/fc! 

Fit)  *- o 

and  hence  the  joint  density  of  X„  and  is 

(4)  r)  -  — L'oiVm-  i)r'  /(T)A(r)lA(»)l'”,lA(r)  -  AO)]"'1 

for  0  <  /  <  t  <  <». 


EXAMPLES: 

(i)  Suppose  £is  Weibull  with  unit  scale  parameter,  i.e.,  £(i)  -  1  -  exp  (-ta).  Then 

»na— 1  «-(* 


i.e.,  a  member  of  the  generalized  gamma  family. 

(ii)  Suppose  f  is  Lomax  with  unit  scale  parameter,  i.e..  Fit)  —  1  -  1/(1  +  r)°.  Then 
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«"[10g(l  +  f)]*"1 
(1  +  f)“+l(»  -  1)! 


A  general  formula  for  the  rth  moment  of  X„  is  not  available,  but  expressions  can  be 
found  for  certain  special  cases.  Thus,  for  example,  for  the  Weibull  distribution  with  unit  scale 
parameter, 

E(X')  -  T(n  +  rla)lY(n). 

Note  that,  as  would  be  expected,  E(X\)  -  T(1  +  r/a),  the  rth  moment  of  the  Weibull 
distribution. 

3.  MONOTONE  FAILURE  RATES 

A  probability  distribution  function  is  said  to  be  IFR  (DFR)  if  the  corresponding  hazard 
function  is  nondecreasing  (nonincreasing).  In  this  section  we  consider  the  effects  of  the  releva- 
tion  transform  on  monotone  failure  rates. 


Let 


*<».)(*)  “  /<»)(*)/^<ii)(f) 


denote  the  hazard  function  of  the  distribution  F^U).  Clearly, 


x(2)(r) 


x(r)A(r) 
1  +  A(r) 


and  hence  X<2)(r)  <  X(r)  for  all  t,  irrespective  of  the  functional  form  of  F,  as  would  be 
expected  from  the  definition  of  the  relevation  transform.  More  generally,  it  is  easily  seen  that, 
for  n  -  1,2,3  ...  ,  X(„+i)(r)  <  X(„)(f).  (We  assume  that  F(t)  >  0  V  t  >0.) 


We  now  show  that  the  n-fold  recursive  relevation  of  F  is  IFR  whenever  F  is  IFR  (an 
analogous  result  is  given  by  Krakowski  (111). 


THEOREM  1:  Suppose  Fis  IFR.  Then  F()I)  is  IFR  for  n  -  1,2,3 . 

PROOF:  The  proof  is  by  induction  on  n ;  if  F(„)  is  IFR,  it  is  not  difficult  to  show  that 
X(n+ i)(r  +  8)  -  X(B+l)(f)  >  0  for  all  r  >  0  and  8  >  0.  The  algebra  is  tedious  but  straightfor¬ 
ward  and  is  hence  omitted.  II 


A  similar  result  does  not  hold  for  DFR  distributions;  a  counterexample  is  provided  by  the 
Weibull  distribution  Fit)  -  1  -  exp(-rJ/4).  In  this  case, 

3  tV2 

k(1)U)  “  4(1  +  r3") 

which  is  clearly  not  monotone  nonincreasing. 

For  convolutions,  the  same  is  true,  i.e.,  F^'Ht),  the  rt-fold  recursive  convolution  of  F,  is 
IFR  if  Fis  IFR,  but  DFR  is  not  preserved  under  convolution.  More  generally,  Fj  G  is  IFR  if 
both  Fand  G  are  IFR  ([2],  page  100)  so  it  seems  plausible  by  analogy  that  1  -  F  #  G  is  also 
IFR.  This  is  not,  in  fact,  the  case;  a  further  restriction  is  required,  motivating  the  following 
definition. 
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DEFINITION:  For  distributions  F  and  G,  the  conditional  hazard  function  of  F  given  G  is 
defined  as  fit)/ Git). 

THEOREM  2:_  If  G  is  IFR  and  the  conditional  hazard  function  of  F  given  G  is  nonde¬ 
creasing,  then  1  -  F  #  G  is  IFR.  Similarly^  if  F  is  IFR  and  the  conditional  hazard  function  of 
G  given  Fis  nondecreasing,  then  1  -  G  #  F  is  IFR. 

PROOF:  If  lit)  -  log  F  #  Git),  it  is  easily  seen  that  lit  +  8)  -  lit)  ^  0  for  all 
t  >  0,  S  >  0.  II 

Note  that  we  do  not  require  Fit)  to  be  IFR  in  order  that  1  -  F  #  Git)  be  IFR. 


4.  MARGINAL  DISTRIBUTIONS 


Let  Y„~  X„  -  (n  -  1, 2, 3,  . . .  )  be  the  time  from  the  in  -  l)th  to  the  nth  failures, 
defining  X0  —  0.  Let  B„(t)  and  £„(/)  denote  the  distribution  and  density  functions  of  Y„. 
Clearly, 

<5) 

which,  on  integration  by  parts,  yields  the  following  expression: 

<«  «.<»  -  S"  /<' +  “>  *• 

(A  similar  expression  is  given  by  Parzen  [131,  page  128.)  This  is  the  survivor  function  of  the 
marginal  distribution  of  Y„.  The  conditional  distribution  of  Y„  given  has  survivor  func¬ 
tion 

FU„_,  +  rVFOf,-,) 

as  noted  by  Barlow,  Proschan  and  Scheuer  (3)  and  Crow  [7]. 


(7) 


An  expression  for  p„it)  follows  on  differentiating  (6): 

(A(m)]"-j 


/Mf)  ~S0  fit  +  u)  X(u) 


in  -  2)? 


du 


for  n  ^  2.  On  integrating  (6),  we  see  that 

(«)  ctr.t-f'fu)  m 

If,  for  example.  Fit)  -  1  -  exp  (-  t°),  then 


EiY, 


\  £(»LzJL±  l/a) 

I*  ~  r*/  \  • 


aT(n) 


Observe  that 


£(r.„) -£(r.> -<!-.>  r(a-ir(-^ Ir 


i.e., 


EiY'+x)  <  (>)  F(  Kn)  if  Fis  IFR  (DFR).  This  inequality  also  holds  for  other  monotone 
failure  rate  distributions.  In  fact,  a  stronger  result  may  be  obtained. 
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THEOREM  3:  The  sequence  Y j,  Yj.  Y3,  ...  is  stochastically  nonincreasing  (nondecreas¬ 
ing)  if  F  is  IFR  (DFR). 

PROOF:  Suppose  Fis  IFR.  Then  A(u)  <  X(u  +  r)  for  all  u  >  0  and  t  >  0  and  hence 

«,♦.<'>  -  f~  F(,  +  u)M  ^*2]n  * 

<  f  "  FU  +  u)  kit  +  u)  du  for  all  t 

Jo  ( n  -  1)! 

-  Bn(t )  from(6), 

i.e.,  the  sequence  Y )(  T2,  K3,  ...  is  stochastically  nonincreasing.  Similarly,  if  F  is  DFR,  the 
sequence  is  stochastically  nondecreasing.  II 

5.  APPLICATION  TO  WARRANTY  ANALYSIS 

The  nonhomogeneous  Poisson  process  provides  a  (possibly)  more  realistic  model  of  a 
repairable  machine  than  the  renewal  process  and,  in  certain  instances,  is  mathematically  more 
tractable.  An  important  example  is  the  Blischke-Scheuer  free-replacement  warranty  model  [4], 
[51. 


Suppose  that  the  lifetime  of  an  item  follows  a  distribution  Fwith  mean  n  and  that  an  item 
is  required  by  a  consumer  for  a  length  of  time  L.  On  failure  before  L,  the  item  is  replaced  free 
if  it  fails  during  the  warranty  period  of  length  IP,  otherwise  a  new  item  is  purchased  at  a  cost  K. 
Let 


MU)  -  Jj  F(r)U) 

denote  the  renewal  function  corresponding  to  F,  recalling  that  F*r>  is  the  /'-fold  recursive  convo¬ 
lution  of  F.  Then  the  average  number  of  times  payment  must  be  made  under  the  free  replace¬ 
ment  warranty  is 

1  +  L/{fi[\  +  MiW)]\. 


Blischke  and  Scheuer  [4]  show  that  the  breakeven  cost  at  which  the  customer  would  be 
indifferent  to  buying  the  item  with  or  without  a  warranty  is 

rf  m*[1  +  M(L)]ll  +  M(fV)] 

M[1  +  MiW)]  +  L 

For  the  nonhomogeneous  Poisson  process,  the  corresponding  expression  is 

(9)  c  _  +  Aa)llH'F(IP)  +  Prl 

(IP+  L)F(W)  +  Dw 

where  Dw  -  F(u)  du.  A  sharper  result  can,  however,  be  obtained  in  this  case;  each  failure 
immediately  succeeding  the  elapse  of  the  warranty  is  a  regeneration  point'  in  an  embedded 
renewal  process  with  inter-renewal  times  governed  by  the  distribution  function 


(10) 


Aw(t) 


FU  +  m  -  F(W) 
1  -  F(W) 
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Thus,  the  expected  number  of  payments  under  the  free-replacement  warranty  is 

i+Jj^  (*) 

which  may  readily  be  evaluated  by  means  of  the  Cleroux-McConalogue  cubic  splining  algorithm 
16],  [12],  The  breakeven  cost  at  which  the  consumer  would  be  indifferent  to  purchasing  with 
or  without  a  warranty  is 

(ID  c‘  *H  +  Aq>l 

1  +  TAjfHL) 
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ABSTRACT 

Different  properties  of  the  HNBUE  (HNWUE)  class  of  life  distributions 
(i.e.,  for  which  J  Fix)  dx  ^  (>)  a  expl -i/p)  for  t  >  0,  where  a  " 
Fix)dx)  are  presented.  For  instance  we  characterize  the  HNBUE 
(HNWUE)  property  by  using  the  Laplace  transform  and  present  some  bounds 
on  the  survival  function  of  a  HNBUE  (HNWUE)  life  distribution.  We  also  ex¬ 
amine  whether  the  HNBUE  (HNWUE)  property  is  preserved  under  the  relia¬ 
bility  operations  (i)  formation  of  coherent  structure,  (ii)  convolution  and  (iii) 
mixture.  The  class  of  distributions  with  the  discrete  HNBUE  (discrete 

HNWUE)  property  (i.e..  for  which  £  P;  ^  0)^(1  -  1/m)*  for 

/-* 

*  -  0,  1,  2 . where  u  -  £  and  -  £  pk)  is  also  studied. 

I- o  *->+! 


1.  INTRODUCTION 

In  many  applications  (e  g.,  reliability,  maintenance  theory,  inventory  theory  and  biometry) 
different  forms  of  aging  are  of  interest.  Therefore,  during  the  last  decades  several  classes  of 
life  distributions  based  on  notions  of  aging  have  been  studied.  The  most  well-known  of  these 
classes  are  the  IFR,  IFRA,  NBU,  NBUE  and  DMRL  classes  (with  duals),  the  definitions  of 
which  are  given  in  the  Appendix. 

In  this  partly  expository  paper  we  shall  study  a  class  of  life  distributions  named  HNBUE 
(harmonic  new  better  than  used  in  expectation)  and  its  dual  class  HNWUE  (W  —  worse). 

In  Section  2  we  present  the  HNBUE  (HNWUE)  property  and  a  corresponding  property  for 
discrete  distributions  named  discrete  HNBUE  (discrete  HNWUE). 

In  Section  3  we  give  relationships  between  the  HNBUE  (HNWUE)  property  and  the 
Laplace  transform  and  the  discrete  HNBUE  (discrete  HNWUE)  property  and  the  generating 
function. 

In  Section  4  we  examine  whether  the  HNBUE  (HNWUE)  property  is  preserved  under  the 
reliability  operations:  (i)  formation  of  coherent  structure,  (ii)  convolution  and  (iii)  mixture. 
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In  Section  5  we  prevent  some  bounds  on  the  survival  function  of  a  HNBUE  (HNWUE) 
life  distribution. 

2.  DEFINITIONS 

The  HNBUE  (HNWUE)  Property 

DEFINITION  1:  A  life  distribution  F  and  its  survival  function  F  =  1  —  F  with  finite 

J.  oo  _ 

F(x)dx  are  said  to  be  harmonic  new  better  than  used  in  expectation  (HNBUE) 

0 

if 

(1)  J*  Fix)dx  <  (i  exp ( —  ///la )  for  /  >  0.  □ 

If  the  reversed  inequality  is  true,  Fand  F are  said  to  be  harmonic  new  worse  than  used  in 
expectation  (HNWUE).  This  gives  a  dual  class  to  the  HNBUE  class  of  life  distributions  in  the 
same  way  as  the  IFR,  IFRA,  NBU,  NBUE  and  DMRL  classes  have  duals  (cf.  Appendix). 

The  HNBUE  and  HNWUE  classes  were  introduced  by  Rolski  [17],  The  reason  for  the 
name  HNBUE  is  the  following.  Suppose  for  simplicity  that  Fit)  >  0  for  /  >  0  and  let 

eFit)  =  J  Fix)dx/Fit) 

denote  the  mean  residual  life  of  a  unit  of  age  /.  Then  the  inequality  (I)  can  be  written 

(2)  - - p-J -  <  fi  for  /  Ss  0. 

7/0 

This  inequality  says  that  the  integral  harmonic  mean  value  of  eF(x),  0  ^  x  ^  r,  is  less  than  or 
equal  to  the  integral  harmonic  mean  value  of  e>f(0)  =  n- 

The  HNBUE  property  can  be  characterized  in  several  ways.  One  of  these  is  the  following. 

Let  7>  denote  the  equilibrium  distribution  of  F defined  by 

(3)  Tf(x )  =  y  f*  Fis)ds  for  x  ^  0. 

This  life  distribution  is  of  interest  e  g.,  in  renewal  theory  fsee  e.g.  Feller  f8j.  Chapter  II).  By 
using  7>,  the  HNBUE  property  can  be  written  as 

fF(t)  ^  Git)  for  t  >  0, 

where  7  =  1—  Tf  and  Git)  =  exp i-t/fi),  t  ^  0. 

One  reason  why  we  find  the  HNBUE  (HNWUE)  class  interesting  is  the  following. 

Let  47  with  life  distribution  Fy  and  survival  function  F,,  for  j  —  1,  2 . n,  denote  the 

life  lengths  of  n  independent  units.  If  n ,  and  nr  denote  the  mean  lives  of  their  series  and 
parallel  system,  respectively,  we  have 

(4)  m,-  r 

/-I 
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and 

(5)  Mp=/0“{1- 

j- 1 

The  integrals  in^  (4)  and  (5)  are  jften  difficult  to  calculate.  However,  if  Fj  is  HNBUE  with 

mean  n,  and  Gj(x)  —  *  (~Af//Ay),  x  >  0,  for  j  =  1,  2 . «,  we  have  the  bounds  (see 

Klefsjo  [13]) 

(6)  ns  >  /0°°  {JX  Gj(x))dx 

1 

and 

(7)  U- nGJ(x)lt6c 

y-i 

which  are  simple  to  calculate. 

In  fact  it  follows  from  Theorem  3.2  on  p.  33  in  Barlow  and  Proschan  [3]  that  (6)  and  (7) 

are  true  under  the  weaker  condition  that  f,,  (2 . in  are  associated  and  each  is  HNBUE 

with  mean  nj.  That  $|,  . in  are  associated  means  a  form  of  dependence  (see  Barlow 

and  Proschan  [3],  Chapter  2). 


The  Discrete  HNBUE  (Discrete  HNWUE)  Property 

A  survival  function  which  is  IFR  (DFR)  is  continuous  except  possibly  at  the  right  (left) 
hand  end  point  of  its  interval  of  support  (cf.  Barlow  and  Proschan  [3],  p.  77).  The  correspond¬ 
ing  is  not  true  in  the  HNBUE  (HNWUE)  case.  There  are  discrete  survival  functions  which  are 
HNBUE  (HNWUE) 

We  shall  now  introduce  another  aging  property  which  is  useful  when  we  discuss  discrete 
distributions. 


Let  f  be  a  strictly  positive  integer  valued  random  variable  and  let  Pk  -  P(f  >  k )  denote 
the  corresponding  survival  probabilities.  Let  further  1  ”  (?o  >  Q\  >  (?2  ^  ■  denote  the 

corresponding  survival  probabilities  of  a  geometric  distribution  with  finite  mean 


M 


i.e.,  Qk  -  (1  -  1  /fi)k,  k  -  0,  1,  2 .  Since  the  discrete  counterpart  to  the  exponential  dis¬ 

tribution  is  the  geometric  distribution  it  seems  natural  to  say  that  a  discrete  distribution  with 
survival  probabilities  Pn  >  P\  ^  P2  ^  . . .  is  discrete  HNBUE  if 


I  Pj  <  I  <2,  for  k  -  0.  1.2 . 

j-k  j-k 

This  leads  to  the  following  definition. 
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DEFINITION  2:  A  discrete  distribution  and  its  survival  probabilities  Pk, 

oo  _ 

k  -  0,  1,  2 . with  finite  mean  M  “  X  are  said  t0  be  discrete  harmonic  new  better  than 

k-0 

used  in  expectation  (discrete  HNBUE)  if 

(I* 

1  -  —  for  *  -  0,  1,2 .  □ 

M  ) 


If  the  inequality  (8)  is  reversed,  the  distribution  and  its  survival  probabilities  are  said  to 
be  discrete  harmonic  new  worse  than  used  in  expectation  (discrete  HNWUE). 

The  condition  that  a  distribution  is  discrete  HNBUE  is  stronger  than  the  condition  that 
the  corresponding  survival  function  is  HNBUE.  In  the  HNWUE  case  the  situation  is  just  the 
reverse.  The  situation  for  the  other  aging  properties  mentioned  in  the  Appendix  vary.  (Cf. 
Klefsjo  (13].)  * 

In  Klefsjo  [13]  we  also  provq  that  the  HNBUE  (HNWUE)  class  of  life  distributions  is 
larger  than  the  NBUE  (NWUE)  class  and  that  the  same  is  true  in  the  discrete  case. 

3.  RELATIONSHIPS  WITH  SOME  TRANSFORMS 

The  HNBUE  (HNWUE)  Property  and  the  Laplace  Transform 

Let 

(9)  <t>(s )  -  r  e~sxdF(x)  for  s  >  0 

Jo 

denote  the  Laplace  transform  of  F.  Further  let 

— r^fT^^FUldxforit-O,  1.  2 . 

s  I  »fo  k\ 

and 

(<**+1  (s)  —  s<r+lfl(t(s)  for  k  —  0,  1,  2,  ... 

(11)  |«0(s)  -  1. 

Block  and  Savits  (4]  proved  that  a  life  distribution  F  is  1FR,  IFRA,  NBU,  NBUE,  or 
DMRL  (or  dual)  if  and  only  if  (a*(s))r.o  has  the  corresponding  discrete  property  for  every 
s  >  0. 

The  following  theorem  states  that  the  same  is  true  in  the  HNBUE  and  HNWUE  cases. 

THEOREM  I :  A  life  distribution  F  is  HNBUE  (HNWUE)  if  and  only  if  the  sequence 
(ak(s))?Lo  is  discrete  HNBUE  (discrete  HNWUE)  for  every  s  >  0. 

PROOF:  It  is  sufficient  to  prove  the  theorem  in  the  HNBUE  case.  The  HNWUE  case 
then  follows  by  reversing  all  inequalities. 

The  "only  iF  part  in  the  HNBUE  case  was  proved  by  Klefsjo  [12], 


(10) 


ak(s)- 


(-1)* 


jP_ 

dsk 
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Let  (j,  j  -  1,  2,  ,  &,  be  independent  random  variables  which  are  exponentially  distri¬ 

buted  with  P(£j  >  x)  -  exp  (-sx),  x  >  0.  Further,  let  gk  denote  the  gamma  density  func¬ 
tion  of  jr  (j.  Then  ak(s)  can  be  written  as 

ak(s)  -  J*Q  gk(x)  F(x)dx  for  k  —  1,  2,  3 . 

Now  suppose  that  a  device  is  subjected  to  some  kind  of  shocks  and  interpret  £j  as  the  ran¬ 
dom  damage  caused  by  shock  number  j.  Further,  suppose  that  the  damages  accumulate  addi- 
tively  and  that  the  device  fails  when  the  accumulated  damage  exceeds  a  random  threshold  with 
distribution  function  F. 

Then  ak(s )  is  the  probability  that  the  device  will  survive  the  first  k  shocks.  This  damage 
model  has  been  further  studied  by  Esary,  Marshall  and  Proschan  [7]  and  Klefsjo  [12],  [14], 

The  Discrete  HNBUE  (Discrete  HNWUE)  Property  and  the  Generating  Function 

Now  suppose  that  (pk)“- 1  is  a  discrete  distribution  with  survival  probabilities 

oo  oo  _ 

Pk  -  £  Pj,  k  -  0.  1,  2,  ....  and  finite  mean  p-  YPk-  A  discrete  counterpart  to  the 
>-*+ i  *-5 

Laplace  transform  for  general  survival  functions  is  the  generating  function 

k(z)~  ±PjzJ-  1  -  T  2'(1  -  z)Pj. 

j- 1  J-0 

If  f  is  a  random  variable  which  has  a  geometric  distribution  such  that 
(15)  /»«-;) -r'd-r)  for >-0.  1,  2,  ... 


we  can  write  k(z)  as 


k(z)  -  1  -  T  P(t-j)Pj. 
j- o 

k 

If  *  —  1,  2,  3 . k,  are  independent  and  have  the  distribution  (15)  then  Jj  f,  has  a 

negative  binomial  distribution,  i.e., 

j.  ;  —  1 1 

zJ  (1  -  z)k  for  j-  0,  1,  2,  ... 


i  * 

k+j  -  \ 

j 

(see  Johnson  and  Kotz  [11],  p.  124).  This  means  that  a  discrete  counterpart  to  (a*(j))*L,0  >n 
[11]  is  (with  1  -  z  changed  to  pand  q  —  I  —  p) 


(16) 


(?*(P)-  | 

Q0(P)  ~  I- 


k+j 

j 


pkqJ  Pj  for  k 


1,  2,  3, 
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Suppose  that  a  device  is  subjected  to  two  different  types  of  shocks,  A  and  B,  say.  At 
every  discrete  time  point  a  shock  of  type  A  occurs  with  probability  p  and  a  shock  of  type  B 
occurs  with  probability  q  -  1  -  p.  If  denotes  the  number  of  B-shocks  between  the  A-shocks 
number  v  -  1  and  v,  v  -  1,  2,  3,  . . . ,  then  has  the  geometric  distribution 


/*(£*  -  j)  -  PQj  for  7  -0,  1,  2,  .... 


Further, 


K 

V- 


represents  the  number  of  B-shocks  before  A-shock  number  k  and 


it 


f ,  -  A  - 


k+j-l 

j 


pkqJ  for  7  —  0,  I,  2 . 


If  the  device  has  the  probability  Pj  of  surviving  the  first  j  B-shocks  then  Qk(p)  in  (16)  denotes 
the  probability  that  the  device  will  survive  until  k  shocks  of  type  A  have  occurred. 


THEOREM  2:  If  (Pk)j”o  is  discrete  HNBUE  (discrete  HNWUE)  then  (Qk(p))?.0 
defined  by  (16)  is  discrete  HNBUE  (discrete  HNWUE)  for  0  <  p  <  1. 


PROOF:  Suppose  that  17  is  a  random  variable  the  distribution  of  which  is  a  mixture  of 
Poisson  distributions,  such  that  the  expected  values  9  of  the  Poisson  distributions  vary  accord¬ 
ing  to  a  gamma  distribution  with  probability  density  function 


gp(x)  - 


1  ~  P 


r  (*) 


-i 


x*_1  exp  (-*/»/(  1  -  p))  for  x  >  0. 


Then 


7,(^-7')-/0  gp(x)ex~dx 


k+j-l 

j 


pkqJ 


(cf.  Johnson  and  Kotz  [11],  p.  125).  This  means  that,  for  k  >  1,  Qk(p)  can  be  written  as 
(17)  Qk(p)  “  f0  gp(x)  H(x)dx, 

where 


(18) 


A  comparison  with  (10)  and  (11)  shows  that  Qk(p)  in  (17)  is  of  the  same  form  as  ak(s)  in 
(11)  but  with  s  —  p/(  1  -  p).  From  Theorem  1  above  it  follows  that  (Qk(p))ki o  is  discrete 
HNBUE  (discrete  HNWUE)  if  and  only  if  //in  (18)  is  HNBUE  (HNWUE).  But  Theorem  3.1 
in  Klefsjo  [14]  gives  that //is  HNBUE  (HNWUE).  This  proves  the  theorem.  n 


REMARK:  By  using  Theorem  1.1  in  Block  and  Savits  [4]  and  Theorems  3.1  and  3.2  in 
Esary,  Marshall  and  Proschan  [7]  it  follows  that  Theorem  2  is  true  with  HNBUE  (HNWUE) 
changed  to  IFR  (DFR),  I  FRA  (DFRA),  NBU  (NWU),  NBUE  (NWUE)  or  DMRL  (IMRL). 

4.  PRESERVATION  OF  THE  HNBUE  AND  HNWUE  PROPERTIES  UNDER  SOME 
RELIABILITY  OPERATIONS 

Table  1  shows  to  what  extent  the  HNBUE  and  HNWUE  classes  are  closed  under  some 
usual  reliability  operations.  For  proofs  and  comments  see  Klefsjd  [13].  Note  in  particular  that 
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a  mixture  of  HNWUE  life  distributions  is  HNWUE.  Until  recently  it  was  an  open  question 
whether  the  NWUE  class  had  this  property.  Now  Bondesson  [5]  and  Mehrotra  [16]  indepen¬ 
dently  have  proved  that  the  NWUE  class  is  not  closed  under  mixtures.  We  also  mention  that  a 
mixture  of  HNBUE  life  distributions  all  of  which  have  the  same  mean  is  HNBUE. 


TABLE  1  —  A  System  of  Independent  HNBUE  (HNWUE)  Components  is 
HNBUE  (HNWUE)  According  to  the  Table. 


Formation  of  coherent 
structures 

Convolution  of 
Distributions 

Mixture  of 
Distributions 

HNBUE 

Not  preserved 

Preserved 

Not  preserved 

HNWUE 

Not  preserved 

Not  preserved 

Preserved 

5.  BOUNDS  ON  THE  SURVIVAL  FUNCTION 

In  this  section  we  present  bounds  on  the  survival  function  F when  /"is  a  life  distribution 
which  is  HNBUE  or  HNWUE.  Such  bounds  are  useful  e.g.,  in  reliability  since  in  a  typical 
situation  the  only  fact  known  a  priori  may  be  that  the  time  to  failure  is  HNBUE  (HNWUE) 
with  expectation  p . 

Bounds  of  similar  character  in  the  IFR  (DFR),  IFRA  (DFRA),  NBU  (NWU),  NBUE 
(NWUE)  and  DMRL  (IMRL)  cases  were  given  by  e.g.,  Barlow  and  Marshall  111,  [2],  Marshall 
and  Proschan  [15],  Haines  and  Singpurwalla  [10]  and  Barlow  and  Proschan  [3].  Some  of  these 
bounds  will  be  presented  later  on. 

Bounds  on  Fin  the  HNBUE  Case 


Theorems  3  and  4  contain  bounds  on  a  HNBUE  survival  function  which  are  useful  when 
only  the  mean  p  is  known. 


(19) 


THEOREM  3:  Suppose  that  Fis  a  life  distribution  which  is  HNBUE  with  mean  p.  Then 

for  t  ^  p 

Fit)  <  ' 


1 

exp 

*  ) 

PROOF:  Let  t  >  0.  By  using  the  HNBUE'definition  (1)  we  get  that 

f  F(x)dx  <  j*s  F(x)  dx  ^  p  exp  (-s/p)  for  every  0  <  s  <  t. 


But 

J*s  F(x)dx  >  (r  -  s)  F(t) 
since  F  is  decreasing.  Hence,  we  obtain  that 

FOX  inf  <*  e,P 

0<  j<  r  t  —  S 


1 

exp 


M 


for  t  ^  p 
]  forr  >  p 


□ 
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We  do  not  know  of  any  analogous  upper  bounds  in  the  NBUE,  NBU  and  DMRL  cases. 
However,  in  the  1FRA  case  Barlow  and  Proschan  [3}  gave  the  upper  bound 

fl  for  t  <  n 

(20)  Fit)  <  |exp  foTt  >M. 

where  w  -  wit)  >  0  is  the  positive  solution  to  1  —  wy.  -  exp(— kv).  Calculations  show,  as  is 
expected,  that  our  HNBUE  bound  is  weaker.  The  two  bounds  are  illustrated  in  Figure  1. 


Figure  1.  The  HNBUE  upper  bound  in  (19),  the  IFRA  upper  bound  in  (20),  the 
HNBUE  lower  bound  in  (21)  and  the  NBUE  lower  bound  in  (24). 


(21) 


THEOREM  4:  Suppose  that  Fis  a  life  distribution  which  is  HNBUE  with  mean  p.  Then 
(exp  i-a/fi)  for  0  <  t  <  n 


Fit)  >  o 


for  t  >  ft 

where  a  -  ait)  is  the  largest  non-negative  number  for  which 
(22)  ia  —  t  +  fi)  exp  i—a/n)  —  fi  +  t  *  0. 


REMARK:  Besides  a  -  0  there  is  for  0  <  t  <  n  exactly  one  positive  solution  to  the 
equation  (22). 
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PROOF:  Let  /  >  0.  It  follows  that 

f  Fix)dx  <  t  +  Fit)  ( s  -  t)  for  every  s  >  t. 

J  0 

Furthermore,  the  HNBUE  definition  gives  that 

J"o  Fix)  dx  ^  fii  1  -  exp i-s/n)). 

Accordingly, 

(23)  Fit)  >  U(1  -  exp(-s/^t))  -  r}/(s  -  t)  for  every  s  >  t, 

i.e.  Fit)  >  sup  !(/x(l  -  exp(-s//i))  -  r)/(s  -  t). 

5>  t 

Standard  calculus  then  gives  that  for  /  <  /a  the  supremum  is  attained  for  s  =  a  given  by  (22). 
Since  the  right-hand  side  of  (23)  is  negative  for  every  s  >  r  if  t  >  n  the  proof  is  complete. 


Table  2  gives  the  lower  bound  in  (21)  for  some  values  on  t/n 


TABLE  2  —  Some  Values  of  the  HNBUE  Lower  Bound  in  (21) 


t/M 

lower 

bound 

t /m 

lower 

bound 

t/M 

lower 

bound 

0.00 

1.000 

0.35 

0.509 

0.70 

0.123 

0.05 

0.903 

0.40 

0.446 

0.75 

0.097 

0.10 

0.813 

0.45 

0.334 

0.80 

0.070 

0.15 

0.729 

0.50 

0.285 

0.85 

0.047 

0.20 

0.650 

0.55 

0.239 

0.90 

0.027 

0.25 

0.577 

0.60 

0.198 

0.95 

0.011 

0.30 

0.509 

0.65 

0.161 

1.00 

0.000 

The  bound  in  Theorem  4  can  be  compared  to  the  result 


(24) 


Fit)  > 


in  —  t)/n  for  t  <  fi 


0  for  t  ^  n 

given  by  Marshall  and  Proschan  (15]  in  the  NBUE  case.  Figure  1  illustrates  the  two  bounds. 


Bounds  on  Fin  the  HNWUE  Case 


THEOREM  5:  Suppose  that  Fis  a  life  distribution  which  is  HNWUE  with  mean  n-  Then 
(25)  Fit)  <  ^  (1  -  exp  i-t/n))  for  t  >  0. 

PROOF:  The  HNWUE  definition  gives  that 

J"Q  Fix)dx  <  n  (1  -  exp(-r//i))  for  t  >  0. 
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Since  Fis  decreasing 

f  Fix)dx  >  t  Fit) 

Jo 

and  the  theorem  follows.  O 

Since  the  NWUE  class  of  life  distributions  is  the  largest  of  the  DFR,  DFRA,  NWU, 
NWUE  and  IMRL  classes,  all  of  which  are  contained  in  the  HNWUE  class  (cf.  Appendix),  it 
may  be  of  interest  to  compare  our  HNWUE  bound  to  the  NWUE  bound 

(26)  Fit )  <  — for  t  >  0 

M  +  i 


given  by  Haines  and  Singpurwalla  [10],  The  two  bounds  are  illustrated  in  Figure  2. 


Fkh  re  2  The  HNWUE  upper  bound  in  (25)  and  the  NWUE  upper  bound  in  (26) 

The  only  lower  bound  on  F  we  can  obtain  when  Fis  HNWUE  is  Fit)  >  0.  This  can  be 
seen  by  studying  Fit)  -  e  exp  i-tt/n),  t  >  0,  which  is  DFR  and  hence  HNWUE. 

The  bounds  presented  in  Theorems  3,  4,  and  5  are  all  sharp  (see  Klefsjo  [13)). 

In  Klefsjo  (13)  bounds  are  also  presented  assuming  that  Fit.)  is  known  for  some  u  >  0. 
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APPENDIX 

A  life  distribution  F  (i.e.  a  distribution  function  with  Fi 0-)  -Jpjand  its  survival  function 
F  -  1  -  F  with  support  S  =  {/:  Fit)  >  0|  and  finite  mean  A*  ”  J"0  Fix)dxm  said  to  be  (or 
to  have) 

1.  increasing  failure  rate  (IFR)  if  the  conditional  survival  function 

,  _  Fix  +  t) 

Fit) 

is  a  decreasing  function  of  the  age  /  whenever  x  >  0  and  1 1  S  ', 

2.  increasing  failure  rate  in  average  (IFRA)  if 

t  _  -In  Fit) 
t 

is  increasing  on  S; 

3.  new  better  than  used  (NBU)  if  Fix)Fiy)  ^  Fix  +  >0  for  x  >  0  and  y  ^  0; 

4.  new  better  than  used  in  expectation  (NBUE)  if 

Fix)  f  °°  Fiy)dv  >  f°°  Fix  +  y)dy  for  x  >  0, 

5.  decreasing  mean  residual  life  (DMRL)  if 

t  —  f  ”  Fix)dx 

Fit)  Jl 

is  decreasing  on  S. 

6.  harmonic  new  better  than  used  in  expectation  (HNBUE)  if 

J  Fix)dx  <  fi  exp  i-t/n)  for  every  t  >  0. 

The  relations  between  the  classes  1.-6.  are  the  following: 


FFR 


IFRA 


NBU 


DMRL 


NBUE 


HNBUE 
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By  reversing  the  inequalities  and  the  directions  of  monotonicity  (and  studying  t  >  0)  we 
get  the  six  dual  classes  DFR,  DFRA,  NWU,  NWUE,  IMRL  and  HNWUE,  respectively.  Here 
D  -  decreasing,  1  —  increasing  and  W  -  worse. 

oo 

A  discrete  distribution  and  its  survival  probabilities  Pk  —  £  pJ%  k  —  0,  1,  2,  ...  ,  with 

>-*  +  ! 

_  00 

Po  -  1,  finite  mean  p.  -  £  Pk  and  S  -  { A :  Pk  >  0)  are  said  to  be  (or  to  have) 

*-o 


1.  discrete  increasing  failure  rate  (discrete  1FR)  if  (Pk/Pk~ |)J1,  is  decreasing  on  Si 

2.  discrete  increasing  failure  rate  in  average  (discrete  1FRA)  if  ( Pkk)“-\  is  decreasing; 

3.  discrete  new  better  than  used  (discrete  NBU)  /*  Pk  >  Pj+k  for  j,  k  •»  0,  1,  2,  . . 

4.  discrete  new  better  than  used  in  expectation  (discrete  NBUE)  if 

P„  T  Pj  >  £  P,  for  k-0,  l,  2,  ...; 

j-k 

5.  discrete  decreasing  mean  residual  life  (discrete  DMRL)  if  I  £  Pj\/Pk  is  decreasing  in 

\  J-k  ) 

k-  0.  1.  2,  ...on  Si 

6.  discrete  harmonic  new  better  than  used  in  expectation  (discrete  HNBUE)  if 

T  P,  <  ni  1  -  — )*for  k  -  0,  1,  2 . 

J-k  M 

The  relations  between  these  classes  of  discrete  distributions  are  the  same  as  in  the  general 
case.  We  get  dual  classes  by  Reversing  the  inequalities  and  directions  of  monotonicity  (and 
dropping  the  requirement  that  P0  -  1). 

Different  properties  of  the  five  classes  IFR,  IFRA,  NBU,  NBUE  and  DMRL  and  their 
duals  are  discussed  for  instance  by  Bryson  and  Siddiqui  [6],  Marshall  and  Proschan  [IS],  Haines 
[9]  and  Barlow  and  Proschan  [3]. 
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ABSTRACT 

This  article  examines  the  short  run  total  costs  and  long  run  average  costs  or 
products  under  warranty.  Formulae  for  both  consumer  cost  under  warranty 
and  producer  profit  are  derived  The  results  in  the  case  or  the  pro  rata  warran¬ 
ty  correct  a  mistake  appearing  in  Blischke  and  Scheuer  |5)  We  also  show  that 
expected  average  cost  to  both  the  producer  and  the  consumer  or  a  product 
under  warranty  depends  on  both  the  mean  or  the  product  liretime  distribution 
and  on  its  failure  rate. 


1.  INTRODUCTION 

Manufacturers  use  warranties  to  accomplish  a  variety  of  goals  such  as  product  promotion, 
quality  assurance,  and  consumer  risk  reduction.  In  some  cases  warranties  are  required  by  law; 
in  others  cases,  though  voluntary,  warranties  are  viewed  as  essential  to  the  producer’s  competi¬ 
tive  position.  Consumers,  on  the  other  hand,  must  decide  between  products  with  differing  war¬ 
ranties  or  between  different  warranty  options  for  the  same  product.  The  problem  of  calculating 
the  costs  and  benefits  of  warranty  policies  is  an  important  one. 

There  are  two  common  types  of  warranty  policies:  the  pro  rata  warranty  and  the  free 
replacement  warranty.  Both  warranties  specify  a  period  of  time  during  which  the  manufacturer 
will  share  some  responsibility  for  the  costs  of  repair  or  replacement  of  the  product.  Under  a 
free-replacement  policy  the  producer  pays  the  entire  cost  of  repair  or  replacement  if  the  product 
fails  before  the  warranty  expires  and  supplies  as  many  repairs  or  replacements  as  needed  during 
the  length  of  the  warranty.  If  the  warranty  is  of  a  pro  rata  type,  the  cost  of  the  replacement 
item  depends  on  the  age  of  the  product  at  the  time  of  failure  and  the  replacement  item  is 
covered  by  an  identical  warranty.  We  assume  that  after  each  failure  the  owner  of  the  product 
instantaneously  purchases  an  identical  replacement.  The  cost  of  the  replacement  depends  on 
the  type  of  warranty  and  the  age  of  the  product  at  failure.  The  assumption  of  instantaneous 
replacement  allows  us  to  describe  the  sequehce  of  product  lifetimes  as  a  renewal  process,  and  to 
bring  to  bear  on  the  problem  the  powerful  techniques  of  renewal  theory. 

Recently  Blischke  and  Scheuer  15]  have  approached  the  problem  of  calculating  the  short 
run  costs  and  benefits  to  both  the  buyer  and  the  seller  of  products  covered  by  warranty  policies. 
Blischke  and  Scheuer  examine  both  the  pro  rata  and  free  replacement  warranties  from  the 
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standpoints  of  both  the  buyer  and  the  seller.  Previous  work  by  Glickman  and  Berger  [6)  exam¬ 
ined  the  free  replacement  warranty  from  the  standpoint  of  the  seller.  Menke  [8]  also  discusses 
the  free  replacement  type  warranty,  but  assumes  that  the  time  between  failures  of  the  product 
is  distributed  according  to  an  exponential  distribution.  Bell  [3]  has  examined  the  long  run  aver¬ 
age  profits  to  the  producer  of  warranty.  Beidenweg  [4]  has  also  examined  the  average  cost  asso¬ 
ciated  with  warranties. 

In  this  paper  we  will  formulate  a  model  describing  the  total  cost  of  ownership  of  a  product 
under  either  a  pro  rata  or  free  replacement  type  warranty.  We  will  also  examine  the  short  run 
total  profit  derived  from  selling  a  sequence  of  products  covered  by  either  pro  rata  or  free 
replacement  type  warranties.  The  expressions  obtained  for  the  cost  of  ownership  and  profit 
under  a  pro  rata  warranty  correct  a  mistake  appearing  in  the  paper  by  Blischke  and  Scheuer  [5], 
We  then  derive  formulae  for  the  long  run  average  cost  of  ownership  and  profit  associated  with 
the  pro  rata  and  free  replacement  warranties.  These  expressions  extend  partial  results  obtained 
by  Bell  [31.  Finally,  we  demonstrate  that,  given  two  products  with  equal  expected  lifetimes, 
one  with  a  monotone  increasing  failure  rate,  and  one  with  a  monotone  decreasing  failure  rate, 
the  warranty  costs  and  costs  of  ownership  under  a  pro  rata  type  warranty  will  be  higher  for  the 
increasing  failure  rate  product  than  for  the  decreasing  failure  rate  product. 

2.  SHORT  RUN  TOTAL  COSTS 


2.1  Pro  Rata  Warranty 

Suppose  that  the  cost  to  the  seller  of  replacing  the  product  at  the  time  of  failure  is  con¬ 
stant  and  equal  to  qt.  Suppose  that  the  cost  to  the  owner  of  replacing  the  product  is  q2 ,  where 
q2  >  </|.  The  difference  between  q2  and  q \  represents  the  difference  between  the  producer’s 
and  consumer’s  cost  of  repair  or  replacement  of  the  product.  We  assume  that  the  product  has 
only  two  possible  states  (working  or  failed)  and  that  after  each  failure  it  is  replaced  by  an  ident¬ 
ical  unit.  Thus,  there  is  no  distinction  between  repair  and  replacement  of  the  product.  In  what 
follows  we  shall  speak  only  of  replacement,  but  the  results  could  apply  equally  well  to  the  case 
in  which  the  product  is  repaired  at  constant  cost  after  each  failure. 


Under  a  pro  rata  warranty  of  length  W,  the  owner  of  the  product  will  pay  q2  if  the  product 
fails  after  time  IF,  and  will  pay  q2(X/  tV)  if  it  fails  at  a  time  X  ^  W,  where  X,  a  random  vari¬ 
able,  with  distribution  F \  represents  the  lifetime  of  the  product.  We  assume  that  after  each 
failure  the  customer  replaces  the  product  immediately  with  an  identical  unit  carrying  an  identi¬ 
cal  warranty.  Under  these  conditions  the  sequence  of  breakdowns  is  described  by  a  renewal 
process.  We  can  then  write  a  function  C(A'),  the  cost  of  replacing  a  unit  which  survives  for  a 
period  X  before  failing: 


C(X)  = 


q2  if  X  >  W 
q2(XIW >  if  X  < 


W 


Suppose  that  the  owner  of  the  product  intends  to  replace  it  until  time  T,  after  which, 
jhould  it  fail,  he  will  not  replace  it.  T  is  the  "life  cycle"  or  time  horizon  of  the  product  as 
defined  in  Blischke  and  Scheuer  [5].  Let  R  ( T)  denote  the  total  replacement  costs  incurred 
from  owning  the  product  up  to  time  7",  ignoring  the  initial  purchase  cost.  We  wish  to  compute 
the  total  expected  replacement  costs  up  to  time  7",  namely  E[R  (Dl.  Conditioning  on  the  time 
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of  the  first  breakdown,  3f,,  and  using  the  fact  that  under  a  pro  rata  warranty  the  warranty  is 
renewed  after  each  failure,  we  may  write: 

0  if  x  >  T 

E[R(T)\X[  =  x)  -  q2+E[R(T-x)}  ir  W  <  x  <  T 

q2(x/W)  +  E[R(T-x)]  if  0  <  *  ^  W 

By  the  law  of  total  probability: 

E\R  ( 71]  =  /J  [q2  +  E[R  ( T  -  *))]  dF(x)  4-  [q2 (x/  W) 

4-  E[R(T  -  *))]  dF(x). 

r  w 

LetMK  =J0  xdF(x).  Define: 

hit)  =  J*o  (q2  a  (q2x/W))  dF(x). 

We  can  rewrite  the  above  equation  as: 

£[K(71]  *=  h(T)  4-  fT  ElR(T- x)\  dF(x). 

Jo 

This  is  a  standard  renewal  equation  and  has  solution: 

£[*(711  -  h(T)  +  fj  h(T  —  u)  dm(u). 

Where  in  the  above  expression  m(x)  is  the  renewal  function  associated  with  the  distribution  F. 
The  renewal  function  is  defined  by  the  equation: 

ml x)  =  £  FinHx) 

n- 1 

where  Fln)ix)  represents  the  n-fold  convolution  of  devaluated  at  the  point  x.  We  can  rewrite 
the  expression  for  £[R(F)]  as: 

(1)  ElR(T)]  -  q2\F(T)  -  F(W)  4-  ^JW]  +  q2  F(T  -  x)  dm(x) 

+  fr  [7  '  (q2z/W)  dFiz)  dmix). 

J  T-W  JQ  n 1 

Denote  by  E[TCPR (71]  the  total  costs  of  replacement  during  the  period  [0,71,  then: 

(2)  E[TCPRm]  -  q2+  E[R  ( 711  -  q2  4-  q2[F(T)  -  F(W)  +*JW\ 

+  q2  f  F(T  —  x)  dm(x )  4-  f  f  (q2z/ W)  dF(z)dm(x). 

•JO  T—  W  J  0 

We  can  derive  the  seller’s  profit  during  the  product  life  cycle  by  noting  that  the  consumer's  cost 
is  equal  to  the  producer’s  revenue.  The  producer's  profit  is  then  his  revenue  less  the  to*  •  of 
the  replacements  made  in  the  interval  (0.71.  The  cost  of  the  replacements  is  qtm(T).  Thus, 
the  seller’s  profit  is: 

(3)  £[ttpr(D]  -  (q2  -  4|)  4-  E[R(T)]  -  <7,m(7l. 

Equation  (2)  disagrees  with  the  result  presented  in  the  paper  by  Blischke  and  Scheuer  [S3. 
Using  the  notation  of  this  paper  we  can  write  the  expression  derived  in  [51  for  the  cost  of  repair 
of  a  single  failed  item: 

(4)  (42/ WO  U» ,  +  HOI  -  £<W0)1. 
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Blischke  and  Scheuer  condition  on  the  number  of  failures  during  the  period  [0, 71.  They  then 
take  expectations  and  obtain: 

(5)  q2  +  m(T)  (q-J  W)  U»  ,  +  W(\  -  F(W))\. 

Conditioning  on  the  number  of  failures  in  an  interval  changes  the  distribution  of  time  between 
failures  in  that  interval,  invalidating  equation  (4).  Thus  the  result  of  Blischke  and  Scheuer  for 
the  pro  rata  warranty  is  in  error.  An  expression  similar  to  (1)  has  also  been  obtained  by 
Biedenweg  [41  but  he  does  not  mention  the  flaw  in  the  argument  of  Blischke  and  Scheuer. 

2.2  The  Free-Replacement  Warranty 


The  analysis  of  the  free-replacement  warranty  is  more  complex  than  that  of  the  pro  rata 
warranty.  The  cost  to  the  consumer  of  replacement  is  zero  if  the  product  fails  before  the  end 
of  the  warranty  period,  W.  If  the  product  fails  after  time  W  then  the  consumer  pays  the  full 
price  for  the  replacement.  Any  replacement  made  during  the  warranty  period  is  warranted  only 
until  the  end  of  the  period,  thus  the  warranty  is  not  renewed  after  each  replacement.  When  the 
product  fails  outside  of  the  warranty  period  and  the  consumer  replaces  it  at  full  cost,  then  the 
replacement  item  is  covered  by  a  free  replacement  warranty  identical  to  the  warranty  on  the  ori¬ 
ginal  item  purchased.  This  guarantees  that  the  consumer  will  receive  as  many  free  replace¬ 
ments  as  needed  during  the  warranty  period,  but  must  buy  the  replacements  at  full  cost  after 
the  warranty  period.  Thus,  the  consumer  pays  every  tV  +  e(  W)  units  of  time,  where  e(W)  is 
the  excess  life  of  the  process  Xu  X2.  ...  at  time  W.  If  N(W)  represents  the  number  of 
failures  by  time  W,  and  S„  represents  the  sum  of  the  first  n  lifetimes,  then  the  random  variable 
c(  W7)  is  given  by: 


e(W)  -  S.vtm+i  -  W. 


Let  CFRi  T.  W)  represent  the  expected  cost  of  owning  a  product  with  a  warranty  of  length  IF  for 
a  length  of  time  T  exclusive  of  initial  purchase  cost.  If,  as  before,  we  condition  on  the  time 
until  the  first  failure.  A",,  we  get  the  following  equation  for  E[CFR(T,W)\X\  =  *]: 


E[CFR{T.W)\X]  =  x] 


0 

q2  +  E[Cfr(T-  x,  W)) 
E[CFR(T-  x.  W  -  x)l 


if  x  >  T 
if  W  <  x  T 
if  0  <  x  W 


This  yields  the  following  functional  equation  for  E[CFR(T,  IV)]: 

(6)  E[Cfr(T.W)]  =  q2(F(T)  -  F(W))  +  E[CFR(T  -  x,  W)]  dF(x) 

+  E[CFR(T  -  x,  W-x)]dF(x). 

The  total  expected  cost  of  ownership  during  the  period  [0,rl,  of  a  product  under  a  free-replace- 
ment  warranty.  TCFR(T,W)  is  given  by: 

TCfr(T.W)  =  E[Cfr(T,W)\  +  q2. 

We  can  determine  a  priori  that  TCFR(x.y)  =  q2  if  y  <  x. 

We  can  derive  an  expression  for  the  expected  total  profit  earned  during  the  period  [0,71. 
Let  Pfr)T.W)  represent  the  profit,  exclusive  of  the  profit  on  the  sale  at  time  0,  earned  during 
(O.rl.  As,  before,  by  conditioning  on  the  length  of  the  lifetime  of  the  first  product  we  get  an 
expression  for  E[PFR(  T.  W)\ Xx  =  ,v]: 
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E[PFR(T,W)\Xx  -  xl 


o  if  x  >  r 

q2  -  <fi  4-  E[PFR  (T-  x,  W)\  if  W  <  x  <  T. 
E[PFR(T  -  x,  W  -  x))  -  qx  if  0  <  x  <  W 


The  expected  profit  in  (0,T]  is  given  by: 

(7)  E[Pfr(T,W)\  =  q2lF(T)  -  F{W))  -  qx  [T(  T)] 

+  [TElPFR(T-x,  W)]dF(x)+  f  WElPFR(T~ x,  W-x)\dF{x). 
Expected  total  profit  in  the  period  [0 ,T]  is  given  by: 

E[TPFR(T,W)\  -  E[Pfr(T.W)]  +  q2~qx. 

Equations  (6)  and  (7)  are  difficult  to  solve  in  general.  Blischke  and  Scheuer  compute  the 
expected  profit  only  for  the  case  in  which  T  —  W,  and  approximate  the  solution  to  (6)  by: 

TCfr(T,W)  =  q2(  1  +  my(T))  ~  q2(  1  +  T/(E(X)[  1  +  m(fV)  1) 

where  my(T)  is  defined  to  be  the  renewal  function  of  the  random  variable  Y  =  W  4-  e(  W). 
They  justify  this  approximation  by  observing  that  as  T  — *  mr(T)/T  —  l/ffA'jll  +  m(W)\. 

3.  LONG  RUN  AVERAGE  COST 


In  many  situations  the  long  run  averages  of  cost  and  profit  are  better  criteria  for  compar¬ 
ing  alternative  policies.  This  is  particularly  true  in  situations  in  which  the  product  life-cycle  is 
expected  to  be  quite  long.  In  this  section  we  will  derive  exact  formulas  for  the  long  run  aver¬ 
age  cost  and  profit  form  a  product  under  three  possible  conditions,  a  no  warranty,  a  pro  rala 
warranty,  and  a  free  replacement  warranty.  We  will  make  use  of  the  following  result  which  is 
proved  in  Johns  and  Miller  [7]  and  Ross  (91. 


Suppose  that  the  sequence  of  pairs  of  random  variables  O',,  X, ),  f—  1,  ...  form  an 
independent  and  identically  distributed  sequence,  then: 

NKT) 

1)  !imr_„  1/T  £  y,  “  E{Y,)/E(Xt)  with  probability  1 


and: 


2)  limr_„ 


(1/T)£ 


N(D 

I 


i-l 


Yx 


E(Yx)/E(Xi). 


In  the  above  equations  ZV(f)  is  the  random  variable  representing  the  number  of  renewals 
of  the  process  (30,  /-l,  ...  in  the  interval  (0,71.  The  above  equations  imply  that  the 

NiT) 

almost  sure  limit  and  the  limit  of  the  expected  value  of  (1/T)  £  T,  are  the  same.  In  what 

/— 1 

follows  we  shall  use  "average"  to  mean  either  of  these  quantities;  since  they  are  the  same  no 
ambiguity  will  arise. 

3.1  The  No  Warranty  Case 

In  the  case  in  which  the  sequence  of  products  purchased  by  the  consumer  is  not  covered 
by  a  warranty  the  analysis  of  the  long  run  average  cost  is  quite  simple.  The  consumer  pays  an 
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amount  q2  after  every  failure.  Since  the  sequence  of  product  lifetimes,  X\.  X2,  ...  was 
assumed  to  be  a  sequence  of  independent  and  identically  distributed  random  variables;  iq2,  X,) 
comprise  a  renewal  reward  process,  and  the  average  cost  of  ownership,  ACswi  >s  given  by: 

ACnw  “  q-JEiX). 

Similarly  the  seller  receives  a  profit  of  q2  ■  q\  every  X:  units  of  time  yielding  and  average 
profit,  APnw  of: 

APNw  “  iQi  ~  q\)l  EiX) . 

Since  the  cost,  q2 ,  and  profit,  q2  -  q\ ,  of  the  initial  purchase  are  constant,  and  finite,  they 
do  not  change  the  long  run  average  cost  and  profit.  Hence,  we  may  ignore  them. 

3.2  The  Pro  Rata  Warranty 

The  long  run  average  cost  of  ownership  of  a  product  under  a  pro  rata  warranty  may  be 
computed  from  equation  (3).  An  easier  approach  is  to  note  that  the  cost  of  replacement  of  a 
product  under  a  pro  rata  warranty  is  independent  of  prior  and  future  replacement  costs.  As  in 
Section  2.1  denote  the  cost  of  the  /1h  product  purchased  by  CiX,).  (CiX,),  X,)  comprise  a 
renewal  reward  process.  The  long  run  average  cost  of  replacement  is  given  by: 

N(T) 

(8)  lim  1/T  T  CU;)  -  EtCW, )]/£(*,)  -  (q2)  1m a  +  W(1  -  F(W))]/WE,  (X). 


We  can  develop  a  more  convenient  expression  for  the  average  cost  of  ownership  under  a 
pro  rata  warranty  in  terms  of  the  cumulative  hazard  function  of  the  distribution  cf  the  lifetime 
of  the  product.  Observe  that  since  F( 0)  -  0: 

Mw“  f"  xdF(x)  -J"  (1  -  F(x))dx  -  (1  -  F(W))W. 

If  we  substitute  this  formula  into  equation  (8)  we  get,  after  simplifying: 

ACpr(W)  -  (q2JW)  f  W  (1  -  Fix))  dx/E(X). 

0 


Assume  that  F  has  a  density  and  denote  the  failure  rate  of  F  by  h(x).  Let  H(x ) 
represent  the  cumulative  failure  rate  up  to  time  x  The  failure  rate,  h(x)  is  given  by: 

h(x)  ~f{x)f\\  -  E(x)]. 

F  is  said  to  have  the  increasing  failure  rate  property  (F  is  IFR)  if  h(x)  is  increasing  and  the 
decreasing  failure  rate  property  if  h(x)  is  decreasing  (Eis  DFR).  H(.x )  is  given  by: 

H(x)  -  f  h(u)du. 

Jo 

We  can  now  make  use  of  the  following  formula  for  1  -  Fix)  (see  Barlow  and  Proschan 

111): 

1  -  Fix)  -  e~Hix)dx. 


naval  research  logistics  quarterly 


J 


COST  ANALYSIS  OF  PRO  RATA  AND  FREE-REPLACEMENT  WARRANTIES 


351 


We  can  write  the  average  cost  of  ownership  of  a  product  under  a  pro  rata  warranty  as: 

(9)  ACpr  ( W)  -  (qJW)  e-H{x)  dx/E(X). 

•'O 

A  similar  calculation  yields  an  expression  for  the  average  profit  to  the  seller  of  a  product 
under  a  pro  rata  warranty  of  length  W.  Denote  the  average  profit  by  APPR(  W)\ 


APpr(W)  =  UqJW)  f"  e-"M  dx  -  qJ/EiX). 


Note  that  the  average  cost  and  profit  depend  on  both  the  cumulative  hazard  function  and 
on  the  mean  of  the  distribution  of  product  lifetime.  We  can  illustrate  this  dependence  in  the 
following: 

PROPOSITION:  Consider  two  different  products  with  lifetimes  described  by  the  random 
variables  X\  and  X2  with  common  expectation  fi.  Let  F\  be  the  distribution  of  Xt  and  F 2  the 
distribution  of  X2.  Suppose  that  the  replacement  cost  to  the  consumer  is  identical  for  each  pro¬ 
duct.  If  F 1  has  a  continuous  increasing  failure  rate  and  F2  has  a  continuous  decreasing  failure 
rate  then: 

AC'pr(W)  >  ACpr  (  HO. 

Where  ACPR(  W)  is  the  long  run  average  cost  of  ownership  of  product  1  and  AC$R(  W)  is 
the  longrun  cost  of  ownership  of  product  2. 

PROOF:  Since  ACPR(  W)  -  AC}R(W)  is  given  by: 

(«*: Wtf.’ 

It  is  sufficient  to  show  that,  for  all  w. 

r”  -w,u>  .  s  f  -h,(x)  , 

Jo  *  ^  >  Jo  *  dx- 

From  Barlow  and  Proschan  [1]  p.  113,  we  know  that,  if  F\  is  IFR  and  F2  is  DFR,  then  for 
any  1  <  ^  : 

- HAi )  - 

e  1  ^  e  1 

Hence  for  any  t  <  (i: 

[/;  >  0 

Now  suppose  that  at  some  point  z  >  /u, 

|/;  -  e *j  <  0. 

Since  e  H'lx)  -  e  Hlix)  is  continuous  there  must  be  some  interval  beyond  p  on  which  it 
is  negative. 
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But  by  assumption: 


Jo 


- H,(x ) 

e  1 


dx. 


So  there  must  exist  and  interval  beyond  z  on  which  e  H'(x)  -  e  Hl{x)  is  positive.  These 

_ //(x)  —  Mix) 

two  observations,  combined  with  the  assumption  of  the  continuity  of  e  1  —  e  2  estab¬ 

lish  that  there  must  exist  x(  and  x2,  n  <  xx  <  x2  <  00  such  that: 


—  e 


-W2<  Jt,) 


e  1  1  —  e  1 


0 

0. 


Thus: 

//,(*,)  -  //2(x,)  -  0 
//,(jc2)  -  H2(x2)  -  0. 

Since  it  is  also  true  that  //,( 0)  -  H2( 0)  —  0,  we  can  apply  the  mean  value  theorem  to 
H i  Cjc)  -  H2(x)  between  0  and  xt  and  between  X\  and  x2  to  find  two  points  r,  and 
r2,  r,  <  X!  <  r2  <  °°  such  that: 

H{(r0  -  Hi(rx)  -  -  h2(rx)  -  0 

H[(r2)  ~  H2(r2)  -  hx(r2)  -  h2(r2)  -  0. 


But  the  last  two  equalities  indicate  that  h\  and  h2  cross  twice,  contradicting  the  assump¬ 
tion  of  monotone  failure  rates.  This  establishes  the  proposition.  If  we  assume  that  the  failure 
rates  are  only  monotone,  rather  than  monotone  and  continuous,  h,(x )  will  be  continuous 
except  at  a  set  of  points  of  Lesbegue  measure  0.  Hence,  H,(x)  will  be  differentiable  every¬ 
where  except  possibly  at  a  set  of  points  of  measure  0.  The  proof  can  be  generalized  to  include 
this  case.  A  similar  proposition  may  be  established  for  the  average  profit. 

3.3  The  Free-Replacement  Warranty 

As  before,  let  e(  IV)  denote  the  excess  life  of  the  renewal  process  AV  X2,  ...  at  time  W. 
Linder  a  free  replacement  warranty  of  length  W  the  buyer  need  only  pay  for  a  replacement  after 
every  w  +  ei  W)  units  of  time.  At  these  points  in  time  the  sequence  of  payments  regenerates. 
Define  Tt,  K2,  to  be  a  sequence  of  random  variables  which  are  independent  with  distributions 
identical  to  the  distribution  of  W  +  e(IV).  Y,  represents  the  time  between  the  /Hh  and  /  +  l5' 
time  that  the  customer  must  pay  for  a  replacement.  We  shall  call  Y,  the  /*h  ’warranty  cycle." 
The  sequence  (Y,,  q2)  forms  a  renewal  reward  process.  Hence,  the  average  cost  of  ownership 
of  the  product  under  a  free  replacement  warranty  is: 

(11)  ACFK(m  -  qJEiY)  -  ?2/(m 0*0  +  1)  EiX). 


Again,  mOf')  is  the  renewal  function  of  the  common  distribution,  F of  the  sequence  Xt. 
As  in  the  case  in  which  there  is  no  warranty,  the  initial  cost  has  no  effect  on  the  long-run  aver¬ 
age  cost. 
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The  seller  of  the  product  replaces  the  product  for  free  whenever  it  fails  during  the  war¬ 
ranty  period,  incurring  a  cost  q\.  At  the  first  time  it  fails  outside  of  the  warranty  period  the 
seller  nets  gj  -  q j.  Let  Q(  Y)  represent  the  net  revenue  received  from  the  sale  of  replace¬ 
ments  during  a  single  warranty  cycle.  The  sequence  ( Yjt  Q(  Y,))  forms  a  renewal  reward  pro¬ 
cess.  We  can  find  ElQ(Y,)]  by  solving  equation  (7)  for  the  case  in  which  T  -  W.  It  is 
simpler,  however,  to  note  that  the  expected  profit  in  any  warranty  period  is  equal  to  the 
difference  between  the  expected  cost  of  replacements  during  the  warranty  period,  and  the  profit 
from  the  first  replacement  after  the  warranty  period  is  given  by: 

£1000]  -  (fl2-  <?,)  -  (?,m(W). 


The  long  run  average  profit  is  given  by: 

(12)  APfr(W)  -  £(0OO1/£OO  -  <f,)  -  m(W)\Km(W)  +  1)  E{X). 

If  we  now  multiply  the  long  run  average  cost  of  ownership  under  warranty  by  the  length  of  the 
life  cycle  T,  we  will  have  an  estimate  of  the  short  run  total  cost  up  to  time  7! 

TCfr(T,\V)  ~  W)  +  1)  £(*)]. 


This  formula  is  similar  to  the  estimate  used  by  Blischke  and  Scheuer.  It  is  clear  from  our 
discussion  and  from  the  theory  of  renewal  reward  processes  that  (see  Johns  and  Miller  17]): 

[TCFR(T.W)  -  +  1)  £(T)])/r-0as  7"  -  °o. 

4.  A  NUMERICAL  EXAMPLE 

In  this  section  we  will  illustrate  the  results  of  the  previous  section  for  three  different  pro¬ 
duct  lifetime  distributions.  We  will  consider  the  gamma,  exponential,  and  hyperexponential 
distributions.  The  gamma  density  is  given  by: 

/„.*(*)  -  A°/lr(a)]jr0-'  e~Kxx  >  0,  a  >  0.  A  >  0. 

In  the  above  equation  T(«)  is  the  gamma  function.  The  gamma  distribution  has  mean  a/A. 
For  a  >  1  it  has  a  monotone  increasing  failure  rate  and  for  «  <  1  it  has  a  montone  decreasing 
failure  rate  (see  Barlow  and  Proschan  [ I ]  p.  75).  The  exponential  density  is  given  by: 

ft(x)  -  6e-txx  >  0,  8  >  0. 

The  mean  of  the  exponential  distribution  is  1/5.  The  exponential  distribution  has  a  constant 
failure  rate.  The  hyperexponential  density  is  given  by: 

-  YP\e~0'*  +  (1  ~ 

The  hyperexponential  distribution  has  mean  -y  ( 1//3))  +  (1  -  y)  (l/j32)  and  a  decreasing  failure 
rate.  Tables  1  and  2  display  the  long  run  average  cost  to  the  consumer  and  profit  to  the  pro¬ 
ducer  for  pro  rata  and  free  replacement  warranties  of  various  lengths.  For  the  gamma  distribu¬ 
tion  the  parameters  were  taken  to  be  a  -  1.5.  A  »  2,  for  the  exponential,  8  -  1.333  and  for  the 
hyperexponential,  y  —  .5,  0]  —  8  and  /S2  -  .7273.  In  each  case  this  yields  a  mean  life  of  .75 
(we  take  the  time  unit  to  be  years).  Thus,  we  have  three  different  product  lifetime  distribu¬ 
tions,  with  the  same  mean,  with  increasing  (gamma),  constant  (exponential),  and  decreasing 
(hyperexponential)  failure  rates.  For  the  cases  described  in  Table  1  and  2  the  producer’s  and 
consumer's  costs  of  repair  were  set  equal  to  $1.  Thus,  the  entries  for  the  producer’s  profit  in 
the  tables  are  negative. 
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The  results  of  Table  1  are  consistent  with  the  behavior  predicted  by  the  proposition  of 
Section  3.  The  entries  in  the  table  were  computed  using  formulae  (9)  and  (10).  For  each  war¬ 
ranty  length  the  cost  to  the  consumer  is  higher  for  the  gamma  distribution  than  for  either  the 
exponential  or  hyperexponential  distributions.  The  difference  in  cost  to  the  consumer  rises 
quite  sharply  for  relatively  short  warranty  lengths.  For  a  warranty  length  of  .6  years  the  long 
run  average  cost  to  the  consumer  for  a  product  with  the  gamma  distributed  lifetime  is  48% 
higher  than  for  a  product  with  the  hyperexponentially  distributed  lifetime.  The  difference 
decreases  as  the  length  of  the  warranty  increases  until,  for  a  warranty  length  of  four  years,  the 
average  cost  to  the  consumer  is  only  5%  higher  for  the  gamma  distributed  lifetime.  This 
behavior  is  intuitive  as  the  average  cost  to  the  consumer  from  a  pro  rata  warranty  approaches  0 
as  the  warranty  length  increases,  and  1  /E(X)  as  the  warranty  length  approaches  0,  regardless  of 
the  product  lifetime  distribution. 

From  the  proposition  of  Section  3  and  Table  1  we  can  see  that  the  common  practice  of 
estimating  a  product  lifetime  distribution  with  known  mean  by  an  exponential  distribution  with 
the  same  mean  may  lead  to  serious  under  or  over  estimates  of  average  cost  and  profit  under 
warranty.  This  result  is  particularly  important  in  the  case  in  which  the  product  lifetime  distribu¬ 
tion  has  a  decreasing  failure  rate.  Decreasing  failure  rate  distributions  are  often  associated  with 
products  which  suffer  from  'infant  mortality,"  i.e.,  a  small  proportion  of  products  fail  early. 
This  is  precisely  the  situation  in  which  a  warranty  policy  may  be  useful.  However,  the  use  of 
an  exponential  distribution  to  approximate  the  product  lifetime  distribution  will  yield  an  over 
estimate  of  long  run  average  costs  of  ownership  under  warranty.  For  the  case  illustrated  in 
Table  1,  at  a  warranty  length  of  .6  years  the  exponential  distribution  yields  a  long  run  average 
cost  of  ownership  which  is  35%  higher  than  for  the  hyperexponential  distribution.  For  a  length 
of  4  years  the  difference  is  5%. 

The  calculation  of  the  costs  and  benefits  of  the  free  replacement  warranty  is  somewhat 
more  difficult,  as  the  formulae  for  ACFR (W)  and  APFR(W)  (equations  (11)  and  (12))  involve 
the  renewal  function  explicitly.  Fortunately,  it  may  be  calculated  easily  for  the  exponential  and 
hyperexponential  distributions.  For  the  gamma  distribution  the  renewal  function  has  been 
tabulated.  The  tables  of  Baxter,  Scheuer,  Blischke  and  McConalogue  [2]  were  used  in  the 
preparation  of  Table  2. 

Table  2  shows  much  the  same  pattern  as  Table  1.  The  difference  is  long  run  average  cost 
between  gamma  and  hyperexponential  distributions  does  not  fall  as  quickly  as  it  does  for  the 
pro  rata  warranty.  From  the  results  of  Tables  1  and  2  we  can  find  the  warranty  lengths  for  the 
pro  rata  and  free-replacement  warranties  which  yield  equal  long  run  average  costs  of  ownership. 
We  can  see  that  the  pro  rata  warranty  required  to  reduce  costs  to  a  given  level  is  longer  than 
the  free  replacement  warranty  needed  to  reduce  average  costs  to  the  same  level. 

5.  CONCLUSION 

In  this  paper  we  have  derived  expressions  for  the  short  run  total  cost  of  ownership  and 
short  run  total  profit  from  the  sale  of  a  sequence  of  products  under  a  pro  rata  warranty.  Equa¬ 
tion  (2)  corrects  a  mistake  in  the  literature.  We  develop  an  integral  equation  for  the  short  run 
total  cost  and  profit  under  a  free  replacement  warranty.  Throughout  our  derivation  we  have 
taken  the  approach  of  computing  the  total  cost  of  ownership  exclusive  of  initial  purchase  cost, 
and  then  adding  in  the  initial  purchase  cost.  This  approach  has  the  advantage  of  allowing  us  to 
treat,  with  minor  changes  in  notation,  the  situation  in  which  the  cost  of  the  first  item  sold  is 
different  from  the  cost  of  subsequent  items. 
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TABLE  1  —  Long  Run  Average  Cost-Pro  Rata  Warranty 


Gamma 

Exponential 

Hyperexponential 

Warranty 

Consumer 

Producer 

Consumer 

Producer 

Consumer 

Producer 

Length 

Cost 

Profit 

Cost 

Profit 

Cost 

Profit 

(year) 

($) 

(S) 

(S) 

($) 

($) 

(S) 

.2 

1.248 

-.086 

1.170 

-.163 

.953 

-.380 

.4 

1.126 

-.207 

1.033 

-.300 

.778 

-.555 

.6 

1.005 

-.328 

.918 

-.416 

.678 

-.655 

.8 

.895 

-.437 

.820 

-.514 

.609 

-.724 

1.0 

.799 

-.534 

.736 

-.597 

.557 

-.776 

1.2 

.715 

-.618 

.665 

-.668 

.514 

-.819 

1.4 

.643 

-.690 

.604 

-.729 

.478 

-.856 

1.6 

.582 

-.752 

.551 

-.782 

.446 

-.887 

1.8 

.529 

-.805 

.505 

-.828 

.418 

-.915 

2.0 

.483 

-.850 

.465 

-.868 

.393 

-.940 

2.2 

.444 

-.889 

.430 

-  903 

.370 

-963 

2.4 

.410 

-.923 

.400 

-.934 

.350 

-.983 

2.6 

.380 

-  953 

.373 

-.961 

.331 

-1.002 

2.8 

.354 

-.979 

.349 

-.985 

.314 

-1.019 

3.0 

.331 

-1.002 

.327 

-1.006 

.299 

-1.034 

3.2 

.311 

-1.022 

.308 

-1.025 

.285 

-1.049 

3.4 

.293 

-1.050 

.291 

-1.042 

.271 

-1.062 

3.6 

.277 

-1.056 

.275 

-1.058 

.259 

-1.074 

3.8 

.263 

-1.071 

.261 

-1.085 

.248 

-1.085 

4.0 

.250 

-1.084 

.249 

-1.072 

.238 

-1.096 

TABLE  2  —  Long  Run  Average  Cost-Free  Replacement  Warranty 


Warranty 

Length 

(year) 

Gamma 

Consumer  Producer 
Cost  Profit 

(S)  (S) 

Exponential 
Consumer  Producer 
Cost  Profit 

($)  (S) 

Hyperexponential 
Consumer  Producer 
Cost  Profit 

($)  ($) 

.2 

1.151 

-.183 

1  053 

-.281 

.798 

-.535 

.4 

.958 

-.375 

.870 

-.464 

.633 

-.700 

.6 

.811 

-.523 

.741 

-.593 

.546 

-.788 

.8 

.684 

-.650 

.645 

-.688 

.487 

-.847 

1.0 

.615 

-.719 

.571 

-.762 

.442 

-.892 

1.2 

.548 

-.786 

.513 

-.820 

.405 

-.928 

1.4 

.494 

-.840 

.465 

-.868 

.375 

-.959 

1.6 

.450 

-.884 

.426 

-.908 

.348 

-.985 

1.8 

.412 

-  921 

.392 

-.941 

.326 

-1.008 

2.0 

.381 

-  952 

.364 

-.970 

.306 

-1.028 

2.2 

.354 

-.979 

.339 

-.994 

.288 

—  I  045 

2.4 

.331 

-1  003 

.317 

-1.016 

.272 

— 1.061 

2.6 

310 

-1  023 

.299 

-1.035 

.258 

-1.075 

2.8 

.292 

-1.041 

.282 

- 1.052 

.246 

-1.088 

3.0 

276 

-1.057 

.267 

-1.067 

.234 

-1.099 

3.2 

.261 

- 1.072 

.253 

- 1  080 

.224 

-1.110 

3.4 

248 

-1  085 

.241 

-1.092 

.214 

— 1.1 19 

3.6 

.237 

-1.097 

.230 

-1.103 

.205 

-1.128 

3  8 

225 

-1  108 

.220 

-1.114 

.197 

-1.136 

4.0 

.216 

-1.117 

.211 

-1.123 

.190 

-1.144 
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We  also  derive  formulae  for  the  long  run  average  cost  and  profit  in  the  case  of  the  pro 
rata  and  free  replacement  warranties.  It  is  possible  to  get  exact,  closed  form  expressions  for 
average  cost  and  profit  for  both  the  pro  rata  and  free  replacement  warranties,  a  definite 
improvement  over  the  short  run  case.  The  formulae  for  A  Cm*.  ACpr ,  and  ACfp  can  be  used 
to  compare  the  costs  to  the  buyer  of  no  warranty,  a  pro  rata  warranty  and  a  free  replacement 
warranty.  Similarly  APNi f,  APpr ,  and  APFR  can  be  used  to  compare  the  profitability  of  the 
various  types  of  warranty  policies.  Our  approach  to  average  profits  is  similar  to  that  of  Bell  (3). 
The  expressions  also  allowed  us  to  see  explicitly  in  the  case  of  the  pro  rata  warranty  the  rela¬ 
tionship  between  hazard  rate  and  the  cost  of  ownership  under  warranty. 
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ABSTRACT 

Shipbuilding  as  currently  practiced  in  U  S  commercial  shipyards  employs  lit¬ 
tle  quantitative  modeling  or  analysis  in  production  planning  This  paper 
presents  a  brief  discussion  of  the  shipbuilding  process  and  focuses  on  one  major 
component  which  is  referred  to  as  outfitting  The  outfit  planning  problem  is 
described  in  detail  and  then  foimally  modeled  as  a  generalization  of  the 
resource  constrained  protect  scheduling  problem  The  value  of  the  approach  as 
well  as  barriers  to  its  adoption  are  also  discussed 


1.  INTRODUCTION 

For  a  number  of  years,  the  U  S.  shipbuilding  industry  as  a  whole  has  been  recognized  as 
not  competitive  with  the  best  foreign  shipbuilders.  Because  shipbuilding  is  considered  a  vital 
industry,  commercial  shipyards  at  one  time  received  a  substantial  subsidy,  which  was  admin¬ 
istered  by  the  Maritime  Administration  (MarAd)  of  the  Department  of  Commerce.  In  addi¬ 
tion,  MarAd  supports  various  research  projects  through  the  Shipbuilding  Research  Program, 
which  are  aimed  at  improving  shipbuilding  in  U  S.  shipyards. 

In  the  past.  Mar  Ad’s  research  emphasis  has  been  technologically  or  design  oriented,  eg., 
welding  technology,  cutting  technology,  propeller  design,  hull  form  design  and  so  forth.  More 
recently,  there  has  been  a  growing  realization  that  important  problems  also  exist  in  production 
methodology,  including  work  methods,  production  standards,  production  planning  and  produc¬ 
tion  control.  This  paper  addresses  a  particular  problem  in  production  planning  and  control  in 
shipyards,  and  presents  some  of  the  findings  of  a  MarAd  sponsored  research  project. 

The  problem  considered  in  this  paper  is  called  the  outfit  planning  problem.  Outfitting  refers 
to  the  fabrication  and  installation  in  a  ship  of  everything  that  is  not  considered  hull  steel,  i.e.. 


‘This  research  was  supported  by  contrails  DO-A0I-7H-3074  and  MA79SAC00067  with  the  Maritime  Administration. 
Department  of  Commerce,  and  Contract  NOOO)4-8f)-K -07(19  with  the  Office  of  Naval  Research 
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everything  except  the  hull  itself,  the  decks,  and  the  bulkheads.  In  many  instances,  outfitting 
represents  as  much  as  50%  of  the  cost  of  the  ship  and  also  as  much  as  50%  of  the  elapsed  time 
for  production.  Clearly,  any  improvement  in  the  outfitting  process  would  have  a  significant 
impact  on  the  shipbuilding  industry. 

The  primary  contributions  of  this  paper  are: 

a.  To  describe  this  important  class  of  production  planning  problems; 

b.  To  present  a  formal  model  of  the  problem; 

c.  To  examine  the  impact  of  shipyard’s  adopting  the  solution  approach. 

1.1  The  Shipbuilding  Process* 

Shipyard  production  activities  can  be  broken  into  two  distinct  groups:  the  steel  phase  activi¬ 
ties ,  which  encompass  all  the  activities  associated  with  fabricating  and  assembling  the  hull,  up  to 
and  including  complete  ship  erection;  and  the  outfit  phase  activities,  which  are  associated  with 
the  acquisition,  fabrication  and  installation  in  the  ship  of  everything  else. 

Conceptually  at  least,  the  outfit  phase  activities  could  all  come  after  the  steel  phase  activi¬ 
ties  were  completed.  As  a  practical  matter,  of  course,  this  would  not  be  a  feasible  production 
method  because  of  the  expense  of  opening  up  closed  compartments  to  land  equipment  or  to 
install  outfit  material  such  as  piping.  This  extreme  oversimplification  does,  however,  capture 
much  of  the  traditional  concept  of  outfitting  as  a  'successor  function"  (6).  That  is,  production 
often  has  been  treated  as  two  distinct  phases  with  very  little  interfacing  of  the  steel  and  outfit 
activities. 

Although  various  production  methods  at.'  .iracticed  in  U.S.  shipyards,  the  most  common 
method  is  hull  block  construction.  The  ship  is  divided  geographically  into  components,  or  blocks. 
A  typical  ship  might  consist  of  100  blocks.  The  steel  components  of  the  block  are  assembled  in 
a  block  assembly  area,  and  the  block,  perhaps  weighing  as  much  as  several  hundred  tons,  is 
then  lifted  onto  the  ways  for  final  erection. 

Shipbuilding  can  be  viewed  in  terms  of  the  material  flows  and  primary  production  facili¬ 
ties.  As  shown  in  Figure  1,  ship  production  occurs  in  three  primary  facilities  with  two  major 
categories  of  supporting  facilities  plus  outside  vendors.  The  steel  shops  represent  facilities 
where  the  steel  forming  activities  take  place.  This  includes  welding  of  stiffeners  and  bracing  to 
large  steel  plates.  Similarly  'other  shops*  include  all  the  facilities  associated  with  fabricating 
sheet  metal,  ducting,  wire,  piping,  equipment,  etc. 

In  this  model,  the  'assembly  area’  represents  any  configuration  of  facilities  where  steel 
and/or  other  materials  are  brought  together  and  processed  prior  to  actual  ship  erection,  i.e., 
prior  to  going  on  the  ways.  The  ways  area  is  the  facility  where  ship  erection,  i.e.,  hull  assem¬ 
bly,  takes  place.  After  hull  assembly,  the  ship  is  completed  at  least  to  the  point  of  being  able  to 
float.  The  ’outfit  pier*  represents  the  stage  of  ship  production  which  follows  float-off. 


'Reference  13)  provides  •  good  summary  of  current  practices  in  U.S.  shipyards. 
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Components,  Materials 

Major  Assemblies,  Completed  Hull 
Figure  l.  General  production  flow  model 
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The  consideration  of  facilities  and  material  flows  leads  naturally  to  the  idea  of  different 
production  modes:. 


(1)  fabrication 

(2)  assembly: 

(3)  erection: 

(4)  outfitting: 


the  production  of  individual  pieces  of  steel,  sheet  metal,  ducting,  pip¬ 
ing,  electrical  cable,  etc. 

fabricated  plates  are  assembly  to  form  blocks;  also  individual  components 
may  be  assembly  to  form  units,  e  g.,  equipment  with  foundations,  valve 
with  piping,  etc. 

the  activity  on  the  ways  that  results  in  the  completed  hull, 
the  remaining  production  activities  that  take  place  once  erection  is  com¬ 
pleted. 


Steel  phase  activities  cannot  be  performed  after  erection,  by  definition.  Outfitting  phase  activi¬ 
ties,  however,  can  be  performed  in  any  of  the  four  modes.  If  they  occur  before  erection,  they 
are  preoutjit  activities. 

Note  that  the  terms  "steel  phase"  and  "outfitting  phase”  have  been  used  to  delineate  activi¬ 
ties  by  type.  The  four  modes  defined  above,  however,  delineate  activities  by  the  timing  of  per¬ 
formance  and  facilities  required.  The  distinction  is  an  important  one  since  there  are  options  for 
many  activities  with  regard  to  production  mode. 

There  are  two  important  aspects  to  production  planning  as  generally  practiced  in  hull  block 
construction.  The  first  is  the  work  breakdown  structure,  which  determines  the  definition  of  the 
work  packages,  i.e.,  the  drawings,  specifications,  operations  sheets,  work  sequences,  and 
material  lists  defining  what  to  do  and  how  to  do  it. 

The  traditional  approach  to  defining  outfitting  work  packages  is  systems  oriented,  that  is, 
the  various  ship  systems  (ventilation,  electrical,  communication,  hydraulic,  etc.)  are  considered 
separately.  While  this  orientation  follows  naturally  from  the  design  phase,  and  simplifies  the 
collection  of  production  cost  data  by  rystem,  it  is  not  the  best  orientation  in  terms  of  produc¬ 
tion  scheduling  and  control.  Only  recently  has  a  product  oriented  work  breakdown  structure 
(PWBS)  been  proposed  (9)  for  U  S.  shipyards.  In  essence,  PWBS  would  yield  work  packages 
describing  all  outfit  work  required  for  a  particular  area  in  a  hull  block. 

The  second  important  aspect  is  the  structure  of  the  scheduling  process.  Figure  2  illustrates 
the  relationships  between  major  planning/scheduling  documents  and  demonstrates  both  the 
hierarchical  structure  of  the  process  and  the  bifurcation  into  separate  steel  and  outfit  schedules. 
The  dashed  lines  in  Figure  2  indicate  that  the  System  Schedule  is  developed  subsequent  to  and 
is  constrained  by  the  Hull  Block  Erection  Schedule  and  the  more  detailed  Block  Assembly 
Schedule.  This  dependence  is  just  one  result  of  the  traditional  treatment  of  outfitting  as  a  suc¬ 
cessor  function. 

2.  THE  OUTFIT  PLANNING  PROBLEM 

Current  practice  in  planning  and  scheduling  ship  production  inherently  limits  the  ability  to 
integrate  steel  and  outfit  activities.  It  results  in  the  bulk  of  outfitting  work  being  performed  in 
the  erected  hull,  either  on  the  ways,  after  a  block  is  closed  in,  or  at  the  wet  dock  or  outfit  pier. 
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Working  conditions  in  the  hull  are  not  ideal  because  of  factors  such  as  difficult  access,  limited 
space  in  which  to  work,  difficulties  in  adequately  venting  noxious  fumes,  and  difficult  work 
positions  (e.g.,  overhead  welding).  The  workplace  is  typically  congested,  with  high  material  flow 
costs,  and  often  hazardous  conditions. 

It  is  now  widely  recognized  that  many  of  these  problems  can  be  relieved  to  some  degree 
by  doing  more  outfitting  activities  earlier  in  the  production  process,  i.e.,  either  in  the  assembly 
area  or  in  the  shop  (vendor)  area.  To  do  this,  however,  requires  a  much  greater  integration  of 
steel  and  outfit  planning  than  has  been  the  rule. 

The  fundamental  problem  is  to  identify  economically  desirable  opportunities  for 
preoutfitting,  this  requires  answering  two  types  of  questions.  The  first  is  related  to  feasibility, 
i.e.,  "Is  there  sufficient  time  and  resource  available  to  do  a  particular  outfitting  activity  in  the 
assembly  or  shop  area,  and  is  it  technically  feasible?"  The  second  question  is  one  of  economics 
"Is  it  more  economical  to  preoutfit  this  activity?"  What  is  needed  is  a  systematic  way  to  answer 
these  questions. 

Any  such  evaluation  procedure  must  have  two  essential  components.  The  first  is  a  flexi¬ 
ble  work  breakdown  structure  which  will  identify  outfit  activities  with  their  geographical 
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location  in  the  ship.  The  second  component  is  a  methodology  for  selecting  the  appropriate  work 
breakdown  and  determining  a  feasible  schedule. 

The  first  component  relates  chiefly  to  work  methods,  while  the  second  relates  chiefly  to 
scheduling.  The  required  work  breakdown  structure  has  been  used  for  many  years  in  the  most 
competitive  foreign  yards  and  is  described  in  detail  in  [8]  and  (9],  It  is  summarized  in  the  fol¬ 
lowing  paragraphs.  The  methodology  for  selecting  a  particular  production  plan  and  developing  a 
feasible  schedule  has  not  yet  been  developed  from  a  quantitative  point  of  view.  Section  3 
presents  a  model  of  the  associated  decision  problem,  and  Appendix  A  contains  a  mathematical 
statement  of  the  model. 

2.1  Product  Oriented  Work  Breakdown  and  Zone  Outfitting 

The  PWBS  divides  the  shipbuilding  process  into  three  basic  types  of  work,  hull  construc¬ 
tion,  outfitting,  and  painting,  and  further  classifies  each  type  of  work  as  fabrication  or  assembly. 
Interim  products  are  classified  by  resource  requirements  and  certain  product  features  such  as 
type  of  system  (e.g.,  lighting  system)  and  zone  (any  geographical  division  of  the  ship).  It  is 
noted  that  PWBS  bears  a  close  resemblance  to  group  technology.  It  is  quite  flexible  and  allows 
activities  to  be  summarized  in  many  different  ways. 

Zone  outfitting  is  to  outfit  activities  what  hull  block  construction  is  to  steel  activities,  i.e., 
it  is  a  logical  method  for  organizing  the  work  to  improve  planning  and  productivity.  Zone 
outfitting  incorporates  three  stages  for  outfitting:  on-unit,  on-block,  and  on-board. 

Outfitting  on-unit  refers  to  the  assembly  of  an  interim  product  consisting  of  only  outfit 
materials.  Examples  are  water  distilling  unit,  fuel  oil  purifier  unit,  pipe  passage  unit,  pump 
room  flat  unit,  etc.  Outfitting  on-unit  impacts  the  shop-related  resources  and  the  material  han¬ 
dling  facilities.  It  may  require  additional  labor  and  materials  for  structural  support  to  units  to 
permit  their  movement  to  the  assembly  or  ways  areas.  It  also  has  some  impact  on  hull  contruc- 
tion  progress  since  the  unit  must  be  landed.  However,  "on-unit  outfitting  should  be  given  the 
highest  priority  .  .  .  because  assembly  is  performed  in  shops  which  provide  ideal  climate,  light¬ 
ing,  and  access"  [8]. 

Outfitting  on-block  refers  to  the  installation  of  outfit  components,  or  units,  in  a  hull  block 
in  the  assembly  area  prior  to  its  erection  on  the  ways.  Outfitting  on-block  is  more  difficult  than 
outfitting  on-unit  because  it  requires  careful  coordination  between  the  steel  activities  and  the 
outfit  activities  (recall  that  there  are  usually  two  distinct  planning  and  scheduling  functions)  and 
may  impact  the  duration  of  a  block’s  occupation  of  an  assembly  area. 

Outfitting  on-board  includes  any  required  outfitting  activity  which  has  not  been  performed 
in  either  of  the  two  previous  stages.  Although  outfitting  on-board  describes  outfitting  as  usually 
practiced,  it  also  allows  for  nontraditional  activities  such  as  the  connection  of  outfit  units  or 
outfitted  blocks. 

Figure  3  illustrates  the  possible  material  flows  among  the  zone  outfitting  stages.  Both  on- 
unit  and  on-block  out>  ting  stages  correspond  to  the  previously  defined  assembly  mode  of  pro¬ 
duction,  although  either  one  could  occur  in  a  shop  facility.  Similarly,  the  on-board  outfitting 
stage  can  correspond  to  either  the  erection  mode  or  the  outfitting  mode  as  previously  defined. 
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Figure  3.  Outfitting  stages 


Clearly,  a  full  implementation  of  the  zone  outfitting  approach  requires  an  outfit  planner  to 
think  in  terms  of  interim  products,  rather  than  systems.  Furthermore,  zone  outfitting  requires 
close  coordination  between  steel  and  outfit  schedules.  Thus,  zone  outfitting  and  PWBS  are 
mutually  reinforcing  approaches  to  ship  production. 


2.2  The  Outfit  Planning  Problem 

Because  zone  outfitting  defines  various  stages  for  outfitting,  it  admits  alternatives  for  the 
execution  of  outfit  activities.  Thus,  the  full  exploitation  of  the  zone  outfitting  concept  requires 
that  production  management  be  able  to  resolve  all  the  alternative  choices  available.  The  prob¬ 
lem  of  choosing  among  the  many  available  outfitting  plans,  which  is  referred  to  here  as  the 
outfit  planning  problem,  is  not  currently  faced  by  production  management  in  U.S.  shipyards, 
simply  because  options  are  not  considered.  The  following  paragraphs  suggest  the  types  of 
options  that  do  exist. 

Although  there  are  options  in  zone  outfitting,  not  every  activity  can  be  performed  in  all 
three  of  the  outfitting  stages.  There  are  some  outfit  components  which  are  only  installed  in  the 
on-board  stage  e.g.,  furnishings  and  other  similar  materials  which  are  subject  to  damage  or 
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pilferage  are  always  installed  in  the  on-board  mode.  These  will  be  designated  on-board  com¬ 
ponents.  Of  the  remaining  components,  some  are  associated  with  distributed  systems,  e.g., 
wireways  or  ventilation  ducting,  rather  than  distinct  units,  e.g.,  pumps,  motors,  valves,  etc. 
These  will  be  referred  to  as  non-unit  components ,  since  outfitting  on-unit  is  not  appropriate. 
Finally,  there  are  the  outfit  components  which  could  be  identified  by  or  associated  with  a 
specific  unit.  These  will  be  referred  to  as  free  components,  since  any  stage  may  be  selected 
(although  on-unit  outfitting  is  preferred). 

Note  that  these  designations  are  fixed  to  some  extent  by  design  practices.  For  example,  a 
given  system  consisting  of,  say,  a  pump  and  piping,  may  be  conceived  and  designed  in  several 
ways.  If  it  is  treated  simply  as  a  collection  of  separate  components  which  must  be  installed  in 
the  ship,  then  the  components  will  have  the  "non-unit"  designation.  Alternatively,  if  the  com¬ 
ponents  are  viewed  as  integral  parts  of  a  single  unit  or  set  of  units,  then  they  will  have  the 
"free"  designation.  Chirillo  and  Jonson  [8]  give  examples  of  outfit  components  that  may  be 
associated  with  units,  although  this  practice  is  not  typical  in  U.S.  shipyards. 

Although  a  free  outfit  component  can  be  associated  with  a  specific  unit,  it  need  not  be 
installed  in  the  on-unit  stage.  The  component  may  instead  be  installed  on-block  or  even  on¬ 
board.  Non-unit  components  may  be  installed  either  on-block  or  on-board,  but  not,  of  course, 
on-unit.  As  indicated  in  the  outfitting  stage  definitions,  units  may  be  installed  either  on-block 
or  on-board.  These  relationships  are  summarized  in  Figure  4  where  a  three-way  distinction  is 
made  between  the  component  type,  its  production  stage,  and  the  production  location. 

Outfit  planning  requires,  for  each  outfit  component,  a  selection  of  outfit  stage.  The  selec¬ 
tion  decisions  are  constrained  by  a  number  of  factors.  In  particular,  it  is  common  practice  to 
take  the  hull  block  erection  schedule  as  fixed  when  planning  the  outfit  activities.  For  example, 
each  hull  block  has  a  fixed  deadline  for  its  completion,  and  at  that  point  in  time  it  is  lifted  onto 
the  ways  for  erection.  Thus,  all  on-block  outfitting  planned  for  that  hull  block  must  be  com¬ 
pleted  before  its  erection  date.  Similarly,  if  a  unit  is  to  be  installed  in  the  block,  all  the  associ¬ 
ated  on-unit  outfitting  must  be  completed  in  time  to  allow  the  unit  to  be  moved  onto  the  block 
and  installed  before  the  block  erection  date.  Furthermore,  if  the  block  "closes  in"  any  previ¬ 
ously  erected  blocks,  any  large  components  (main  engine,  diesel  generators,  etc.)  must  be 
landed  in  these  blocks  prior  to  closing  in. 

The  hull  block  erection  schedule  is  a  constraint  in  outfit  planning  because  of  convention. 
It  is  also  possible  to  treat  the  hull  block  erection  schedule  as  part  of  the  decision  process,  i.e.,  if 
it  were  justifiable,  a  hull  block  might  be  delayed  to  allow  more  on-block  outfitting  to  be  per¬ 
formed.  This  practice  does  not  appear  to  be  in  use  currently  in  the  U.S.,  and  is  not  considered 
in  the  developments  to  follow.  It  is,  however,  common  in  Japanese  shipyards,  and  may  be 
adopted  by  U.S.  yards  in  the  future. 

Another  constraint  which  may  affect  outfit  planning  decisions  in  many  yards  is  the  avail¬ 
able  lifting  capacity.  Outfit  units  and  outfitted  hull  blocks  must  not  exceed  the  safe  lifting  capa¬ 
city  of  the  available  equipment.  Size  is  a  similar  consideration,  i.e.,  units  must  be  sized  in  light 
of  the  available  access. 

The  effect  of  outfit  planning  decisions  on  limited  yard  resources  must  also  be  considered. 
Among  the  resources  to  be  considered  are  labor  and  material  availability  and  production  or 
storage  space.  When  determining  outfit  stages,  care  is  required  to  insure  that  the  resulting  pro- 
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Figure  4.  Relationships  between  outfit  components,  outfitting  stages  and  production  location 


duction  schedule  does  not  call  for  more  labor  than  is  available  in  each  affected  craft  and  grade. 
Likewise,  since  production  typically  requires  space  and  fabricated  components  or  units  may 
need  to  be  stored  temporarily,  the  available  yard  facilities  must  not  be  overcommitted. 

These  resource  allocation  considerations  are  perhaps  the  most  difficult  aspect  of  outfit 
planning,  especially  in  situations  where  multiple  ships  are  in  production  simultaneously.  The 
reason  is  that  in  order  to  guarantee  feasibility  of  the  mode  selections,  a  feasible  schedule  must 
be  determined.  The  selection  decisions  and  subsequent  scheduling  decisions  interact  in  a  com¬ 
plex  fashion  and  cannot  be  made  independently. 
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There  has  been  no  discussion  as  yet  of  the  specific  criteria  by  which  the  outfit  plan  is  to 
be  evaluated.  Several  criteria  may  be  considered,  all  motivated  by  economic  considerations. 
Considerable  cost  savings  are  indicated  16,  8]  for  outfitting  on-unit  and  on-block,  relative  to 
outfitting  on-board.  These  cost  savings  result  from  lower  skill  requirements,  better  material 
access,  less  congestion,  better  quality  control,  etc.  One  criterion,  which  should  be  minimized, 
is  total  cost  of  outfitting. 

Another  result  of  increased  on-unit  and  on-block  outfitting  would  be  reduced  delivery 
time.  Reducing  delivery  time  is  favorable  to  both  owner  and  builder,  since  the  owner  has  use 
of  his  ship  sooner  and  the  builder  receives  final  payment  sooner.  In  addition,  they  both  benefit 
from  the  reduced  ".  .  .  interest  costs  for  the  substantial  accumulating  investment  represented  by 
construction  progress  and  for  achieving  maximum  utilization  of  expensive  facilities  such  as 
building  dock"  (8],  Thus  a  second  criterion,  to  be  minimized,  would  be  completion  time,  in 
particular  circumstances,  yet  other  criteria  might  apply. 

The  outfit  planning  problem  can  now  be  stated  more  precisely  as  follows: 

Given: 

(1)  a  catalog  of  the  outfit  phase  activities  for  which  there  are  outfitting  options; 

(2)  for  each  such  activity,  a  list  of  the  outfitting  options,  including  time,  resource  and 
precedence  requirements; 

(3)  the  ship  delivery  schedule  and  any  fixed  milestone  deadlines; 

(4)  labor  availability  by  craft  and  grade; 

(5)  facility  capacities  and  availabilities  (lifting,  dovered  space,  yard  space,  etc.);  and 

(6)  other  constraining  factors  (material  availability,  rate  of  cost  accumulation,  etc.). 

Determine:  The  outfitting  option  to  be  used  for  each  outfit  phase  activity  considered,  along 
with  the  necessary  schedule. 

The  outfit  planning  problem  is  one  of  selecting  from  a  number  of  interrelated  options,  a  set  of 
options  that  will  satisfy  the  given  resource  constraints  while  optimizing  some  criterion,  such  as 
outfit  costs  or  delivery  date. 

3.  MODEL  FORMULATION 

Modeling  the  problem  involves  the  use  of  activity  network  models  such  as  CPM  or  PERT 
(29,  30,  31,  32,  33].  For  practical  as  well  as  academic  reasons  (see,  e.g.,  (14,  241),  only  deter¬ 
ministic,  i.e.,  CPM-like,  models  will  be  considered. 

The  use  of  deterministic  activity  networks,  or  DANs  (16),  to  model  ship  production 
requires  some  assumptions  about  the  ship  building  process. 

Al:  Ship  specifications,  such  as  production  drawings,  can  be  converted  into  well 
defined,  distinct  work  packages,  or  activities. 
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A2:  Assuming  unlimited  production  resources,  the  only  relationship  between  the 
activities  is  one  of  sequence  or  precedence.  An  activity,  "A,"  precedes  another 
activity,  "B,"  if  "A"  must  be  completed  before  ”B"  can  be  initiated. 

These  two  assumptions  permit  graphical  representation  of  the  relationship  between  production 
activities.  The  one  used  in  this  research  is  the  activity  on  node,  or  AON,  representation  [16]. 

Note  that  assumption  A2  does  not  limit  the  relationships  between  activities  to  precedence 
only.  Other  types  of  relationships  are  possible,  for  example,  two  activities  may  require  the  use 
of  the  same  limited  resource. 

A3:  Associated  with  each  activity  is  information  about  its  duration  (including 
resource-time  options),  about  its  requirements  for  various  resources,  and  about  its 
due  date  (or  completion  deadline)  if  appropriate. 

In  order  to  use  the  DAN  model  in  planning,  it  must  include  certain  information  about  the 
activities  or  work  packages  beyond  precedence  relationships.  At  a  minimum,  each  activity  has  a 
given  duration  and  resource  consumption.  In  addition,  it  is  often  the  case  that  the  activity 
duration  depends  on  the  rate  at  which  resources  are  applied,  i.e.,  there  are  resource-duration 
options.  Start  and  due  dates  are  often  imposed  because  of  special  considerations  beyond  just 
the  work  content  of  the  project,  e  g.,  a  hull  erection  schedule. 

A4:  The  various  resources  required  to  perform  the  activities  are  explicitly  defined  and 
the  availability  of  the  resources  over  the  planning  horizon  is  specified. 

The  resources  required  by  the  activities  can  be  of  two  types.  Some  resouices  are  con¬ 
sumed  as  they  are  applied  to  production,  e.g.,  steel  which  is  applied  to  a  particular  hull  block. 
Any  subcontracted  material  falls  into  this  category.  This  type  of  resource  must  be  available 
when  the  associated  activity  is  scheduled. 

The  other  resource  type  is  available  at  a  certain  rate  rather  than  a  total  amount.  For 
example,  a  given  labor  pool  in  a  particular  craft  translates  into  a  fixed  number  of  man  hours  per 
day  of  that  resource.  Of  course,  over  the  long  run,  the  number  of  man  hours  can  be  changed 
by  changing  the  size  of  the  labor  pool.  Thus,  this  type  of  resource  is  not  "used  up"  in  the  same 
way  that  materials  are. 

Resources  of  this  type  present  more  difficult  planning  problems.  One  reason  is  that  the 
cost  of  the  resource  depends  on  the  rate  at  which  it  is  used,  i.e.,  if  the  resource  is  not  fully  util¬ 
ized  in  some  period,  there  is  a  wasted  resource  cost.  Thus,  one  goal  is  to  schedule  the  produc¬ 
tion  activities  so  that  resources  of  this  type  are  always  fully  utilized. 

The  classical  DAN  models,  such  as  CPM,  are  inadequate  for  the  outfitting  planning  prob¬ 
lem  because  they  are  based  on  the  assumption  of  a  single,  unambiguous  definition  of  the  activi¬ 
ties.  In  contrast,  the  essence  of  the  outfit  planning  problem  is  to.  select  a  particular  activity 
definition  (i.e.,  select  production  modes)  from  among  all  the  available  alternatives.  It  will  be 
necessary  to  extend  the  DAN  models  to  incorporate  this  additional  complexity  and  to  develop 
the  corresponding  extensions  to  the  analytic  methodologies. 
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3.1  An  Activity  Network  Model  of  the  Outfit  Planning  Problem 

The  goal  of  the  following  discussion  is  the  development  of  a  conceptual  model  for  the 
outfit  planning  problem  which  is  consistent  with  an  activity  network  based  approach  to  planning 
and  scheduling.  It  must  be  recognized  at  the  outset  that  the  process  being  modeled  exists  only 
hypothetically  and  that  the  model  does  not  represent  any  existing  process.  It  is  apparently  the 
case  that,  at  the  present  time,  very  few  U.S.  shipyards  employ  activity  network  based  planning 
or  scheduling  procedures  in  production,  thus,  the  proposed  model  constitutes  u  significant 
departure  from  currently  standard  practice.  On  the  other  hand,  it  is  also  apparently  true  that 
interest  in  this  t>.te  of  methodology  is  growing  in  many  U.S.  shipyards,  so  that  the  proposed 
model  is  in  line  with  longer  term  trends  in  the  industry  and  is  based  upon  some  elements 
currently  recognized  in  the  industry. 

3.1.1  Defining  the  Activities 

Current  practice  in  U.S.  production  planning  (111,  appendix  4)  calls  for  work  packages  of 
200-2000  man-hours,  involving  a  single  craft  or  trade.  For  comparison,  the  Japanese  practice 
[8]  is  to  define  work  packages  of  40-120  man-hours.  The  following  developments  are  based  on 
the  premise  that  activity  descriptions  can  be  made  at  the  level  of  the  smallest  fabricated  com¬ 
ponent  and  then  aggregated  as  necessary.  Furthermore,  an  activity,  as  discussed  in  the  previous 
section,  may  consume  different  resources,  i.e.,  it  may  involve  two  or  more  crafts.  The  organi¬ 
zational  and  operational  ramifications  of  this  departure  from  standard  practice  will  be  explored 
later. 

In  developing  the  model,  it  will  be  useful  to  maintain  the  distinction  between  outfit  com¬ 
ponents,  which  are  associated  with  the  outfit  materials,  and  the  outfit  activities,  which  are  associ¬ 
ated  with  production,  i.e.,  fabrication,  assembly  and  installation.  The  outfit  components  were 
categorized  as  on-board,  non-unit,  or  free,  and  outfit  activities  were  classified  as  on-unit,  on- 
block  or  on-board.  The  question  which  follows  from  this  classification  scheme  is,  "How  are  the 
activities  corresponding  to  a  given  outfit  component  defined?" 

A  fundamen'?!  assumption  about  outfit  planning  is: 

A5:  On-ini?  outfitting  is  preferred  to  on-block  outfitting,  which  is  preferred  to  on¬ 
board  outfitting. 

This  assumption  implies  that  if  there  were  no  resource  conflicts,  or  time  constraints,  outfitting 
would  always  be  done  as  early  as  possible  in  the  production  process.  It  is  the  resource  conflicts 
and  milestone  event  deadlines  which  lead  to  deviations  from  this  "ideal"  outfitting  plan. 

Free  Outfit  Components 

The  free  outfit  components  present  the  greatest  latitude  in  planning  production  since  they 
may  lead  to  on-unit,  or  on-block,  or  on-board  production  activities,  or  to  a  combination.  As  a 
consequence,  these  are  the  activities  that  present  the  most  difficulties  in  formulating  the  DAN 
model  of  outfit  planning. 

The  "ideal"  outfitting  plan  would  call  for  maximum  use  of  the  on-unit  stage,  with  resulting 
units  being  installed  whenever  possible.  Thus,  the  following  assumption  is  made: 
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A6:  The  outfit  planning  process  creates  for  the  free  outfit  components,  a  catalog  of 
maximally  outfitted  units.  For  each  unit,  all  the  required  materials,  fabricated 
pieces  and  assembly  work  elements  are  specified.  The  set  of  outfit  work  elements 
for  a  given  unit  will  be  referred  to  as  the  maximum  outfit  set  for  the  unit. 

A  particular  unit  from  this  catalog  will  generate  many  individual  activities  in  the  model  (recall 
that  the  individual  activities  may  be  aggregated  at  a  later  step  in  the  planning  process).  For 
example,  each  individual  component  of  the  unit  must  be  either  fabricated  or  purchased,  result¬ 
ing  in  the  definition  of  either  a  fabrication  activity  or  a  purchasing  activity.  Each  subassembly 
operation  likewise  results  in  the  definition  of  a  distinct  activity. 

An  implicit  requirement  is  that  the  units  in  this  catalog  are  nonoverlapping,  i.e.,  no  free 
outfit  component  is  a  component  of  more  than  one  unit.  Thus,  the  definitions  of  the  units 
themselves  are  considered  as  fixed  at  an  earlier  stage  in  the  planning  process.  The  problem  of 
selecting  from  among  alternative  unit  definitions  is  included  in  the  proposed  model  in  certain 
fairly  restrictive  situations  as  seen  later 

It  may  be  the  case  that  selection  of  the  on-unit  outfitting  rather  than  on-block  or  on-board 
"induces"  additional  work  elements.  For  example,  additional  bracing  may  be  required  to 
prevent  damaging  the  unit  during  handling  and  moving.  Any  such  induced  work  must  be 
reflected  in  additional  activities  in  the  DAN  model. 

Since  the  ideal  outfitting  plan  may  not  be  feasible  given  the  available  resources  and  mile¬ 
stone  event  deadlines,  it  is  necessary  to  specify  the  alternatives  to  be  allowed  within  the  outfit 
planning  model. 

A7:  For  each  unit,  there  is  a  set  of  outfit  components  which  represents  the  least 
amount  of  on-unit  outfitting  that  can  be  done  and  still  be  economically  justifiable. 
The  associated  set  of  outfit  work  elements  will  be  called  the  minimum  outfitting  set 
for  the  unit. 

This  assumption  implies  that  if  a  particular  unit  is  selected  from  the  catalog  for  on-unit 
outfitting,  it  need  not  be  completely  outfitted.  However,  it  will  include  at  least  those  outfit 
work  elements  contained  in  its  minimum  outfitting  set.  Associated  with  the  maximum  and 
minimum  outfit  sets  are  related  sets  of  outfit  components,  designated  as  the  maximum  and 
minimum  outfit  kits. 

A  given  unit,  may  be  assembled  in  the  on-unit  mode.  If  so,  it  must  include  all  com¬ 
ponents  in  its  minimum  outfit  kit  and  it  may  include  any  additional  components  in  its  max¬ 
imum  outfit  kit.  Any  work  elements  from  the  maximum  outfit  set  which  are  not  selected  for 
on-unit  outfitting  must  be  performed  at  a  subsequent  stage,  i.e.,  either  on-block  or  on-board. 
The  outfitted  unit  itself  also  may  be  installed  on-block  or  on-board.  If  installed  on-block,  its 
assembly  and  installation  must  be  completed  before  the  block  erection  deadline. 

These  possibilities  can  be  incorporated  in  a  CPM-like  precedence  diagram  as  illustrated  in 
Figure  5.  In  this  example,  nodes  1-4  represent  the  purchase,  fabrication,  or  subcontracting 
activities  for  outfit  components  in  the  maximum  outfitting  kit  for  some  unit.  The  components 
corresponding  to  nodes  2,  3  and  4  are  in  the  minimum  outfitting  kit  for  the  unit,  i.e.,  at  least 
these  components  must  be  included  if  the  unit  is  selected  for  fabrication. 
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Figure  5.  Activity  model  for  free  outfil  components 


A  component,  such  as  the  one  corresponding  to  node  1  in  the  figure,  which  is  in  the 
maximum  outfitting  kit  has  the  following  characteristic.  It  is  a  component  which  could  be 
included  in  the  unit  fabrication  and,  in  fact,  it  would  be  desirable  to  include  it.  However,  if 
there  are  frustrating  circumstances,  for  example,  insufficient  fabrication  lead  time  or  insufficient 
resources  (labor,  equipment,  or  material),  then  such  a  component  may  be  left  off  the  unit.  It 
is,  in  a  sense,  an  auxiliary  component  of  the  unit.  On  the  other  hand,  components  in  the 
minimum  outfitting  kit  are  considered  essential  to  the  unit,  so  much  so  that  they  cannot  be 
omitted  from  the  unit. 

The  activities  represented  in  the  diagram  by  square  nodes  are  the  ones  subject  to  the 
outfit  planning  decisions,  which  designate  the  specific  stage  of  outfitting  for  each  component. 

To  insure  that  components  produced  by  the  first  four  activities  in  Figure  5  are  actually 
installed,  the  outfit  planning  decisions  must  obey  the  following  guidelines: 

(1)  Exactly  one  activity  is  selected  from  each  of  the  sets: 

(6.7.12)  to  insure  that  component  1  is  included; 

(5.8.13)  to  insure  that  component  2  is  included; 

(5.9.14)  to  insure  that  component  3  is  included; 

(5,10,15)  to  insure  that  component  4  is  included; 

If  activity  5  is  selected,  activities  8,  9,  10,  13,  14,  and  15  cannot  be  selected. 
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(2)  Activity  6  can  be  selected  only  if  activity  5  is  selected; 

(3)  If  activity  5  is  selected,  then  either  11  or  16  must  be  selected;  if  5  is  not  selected, 
neither  1 1  and  16  can  be. 

If  these  three  guidelines  are  followed,  then  a  feasible  solution  will  be  constructed  for  the  outfit 
planning  problem.  Note  that  if  an  activity  is  not  selected,  it  simply  becomes  a  discarded  option, 
i.e.,  it  does  not  affect  subsequent  scheduling  or  resource  allocation  decisions. 

The  example  illustrates  additional  details  that  can  be  incorporated  in  this  type  of  model. 
For  example,  if  in  addition  to  the  minimum  outfitting  kit,  component  1  is  also  to  be  included  in 
the  on-unit  outfitting  (i.e.,  activity  6  is  selected)  the  associated  work  element,  activity  6,  must 
be  completed  by  the  time  the  unit  is  installed,  either  on-block  (activity  11)  or  on-board 
(activity  16).  This  is  indicated  by  the  precedence  relationships  (5,  6),  (6,  11)  and  (6,  16). 

In  this  example,  there  is  a  required  sequence  for  installing  the  outfit  components,  com¬ 
ponent  2  cannot  be  installed  until  after  components  3  and  4  have  been  installed,  and  com¬ 
ponent  1  cannot  be  installed  until  after  component  2  has  been  installed.  These  restrictions  are 
satisfied  by  requiring  that  activity  13  has  as  its  predecessors,  either  9  or  14,  and  either  10  or  15. 
Similarly,  activity  12  has  as  predecessors  either  8  or  13.  Note  also  that  if  activity  5  is  selected 
(i.e.,  the  unit  is  assembled)  then  on-block  or  on-board  outfitting  for  component  1  must  follow 
installation  of  the  unit. 

Finally,  note  that  the  block  erection  schedule  can  be  introduced  into  the  model  simply  by 
specifying  a  due  date  for  the  unnumbered  node  corresponding  to  block  erection.  One  addi¬ 
tional  consideration  was  left  out  to  simplify  the  figure  and  the  discussion.  It  might  be  desirable 
to  treat  on-board  outfitting  as  two  distinct  stages,  one  corresponding  to  prefloat  off  outfitting 
and  one  corresponding  to  wet-dock  outfitting.  This  consideration  could  be  affected  within  the 
model  simply  by  defining  four  additional  nodes,  one  for  each  of  the  four  outfit  components, 
and  adding  the  necessary  precedence  relationships.  This  is  illustrated  for  the  previous  example 
in  Figure  6. 

Non-U  nit  Ou(fit  Components 

The  non-unit  outfit  components  involve  fewer  production  options  than  the  free  outfit 
components  and  it  is  therefore  considerably  easier  to  define  the  alternative  activities  generated 
by  them.  In  fact,  non-unit  components  generate  a  subset  of  the  activities  generated  by  free 
components.  For  instance  referring  to  the  example  of  Figure  5,  suppose  the  on-unit  outfitting 
activities,  which  are  activities  5,  6,  11,  and  16,  are  omitted.  The  resulting  activity  network 
would  describe  the  options  available  for  non-unit  components  1-4. 

In  addition  to  sequencing  requirements  among  the  non-unit  components,  there  may  also 
be  sequencing  requirements  between  the  non-unit  components  and  certain  free  components  or 
their  associated  units.  The  various  types  of  relationships  are  summarized  in  Figure  7.  As  indi¬ 
cated  in  the  figure,  the  model  must  account  for  the  possibility  of  sequencing  requirements 
between  the  non-unit  components  and  certain  free  components  or  their  associated  units,  as  well 
as  between  the  non-unit  component  and  certain  on-board  components. 

As  with  the  free  outfit  components,  it  is  conceptually  easy  to  extend  the  model  to  allow 
two  distinct  on-board  outfit  stages.  The  illustration  will  not  be  repeated. 
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On-Board  Outfit  Components 

The  on-board  outfit  components  require  no  outfitting  mode  decision,  unless  the  possibility 
of  two  on-board  stages  (pre-float  off  and  wet-dock)  are  allowed.  In  this  case,  each  component 
generates  two  alternative  outfitting  activities  with  precedence  relationships  as  shown  in  Figure 
8.  The  requirement,  then,  is  to  select  exactly  one  of  the  two  activities. 


Figure  8.  Activity  model  for  on-board  components 


3.1.2  Defining  the  Decisions  and  Constraints 

The  fundamental  decisions  required  in  the  outfit  planning  problem  is  the  resolution  of  the 
options  associated  with  each  outfit  component.  This  selection  decision  considers  the  activity  net¬ 
work  and  requires  a  choice  of  exactly  one  of  the  alternative  outfit  activities  for  each  outfit  com¬ 
ponent  (and  perhaps  the  resulting  unit).  The  selection  decision  must  satisfy  the  sequencing 
requirements  which  are  represented  in  the  activity  network  as  arrows.  The  sequencing  require¬ 
ments  are  constraints  on  the  selection  decision. 

If  there  were  no  other  constraints,  the  selection  decision  would  be  trivial  because  <>t 
assumption  A5,  i.e.,  each  component  would  be  outfitted  as  early  as  possible  in  the  production 
process.  There  are,  however  three  major  types  of  constraints  which  may  be  violated  b\  sue  h  .< 
selection: 

(1)  [Time]  The  sequencing  requirements  may  lead  to  a  longer  production  . 

some  stage  than  is  available  from  the  given  block  erection  and  flo.ii  f  > 
stones. 
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(2)  [Labor]  Even  if  there  is  sufficient  time,  the  activities  selected  to  be  performed 
between  two  milestones  may  require  more  labor  hours  than  are  available  in  the 
crafts. 

(3)  [Weight  and  Size]  Even  if  there  is  sufficient  time  and  labor,  the  number  of  com¬ 
ponents  selected  for  a  unit  or  the  number  of  units  and  components  selected  for  a 
block  may  lead  to  a  unit  or  block  which  is  too  large  for  the  available  facilities  or 
access. 

The  constraints  on  time  and  weight  may  be  easily  checked  once  the  selection  decision  is  known. 
Such  is  not  always  the  case,  however,  for  the  labor  availability  constraints. 

In  order  to  know  whether  or  not  a  labor  availability  constraint  is  satisfied,  a  schedule  for 
the  activities  must  be  specified.  Thus,  in  situations  where  labor  availability  is  a  limiting  factor, 
solving  the  outfit  planning  problem  requires  making  a  scheduling  decision  in  addition  to  the 
selection  decision. 

The  scheduling  decision  by  itself  is  an  extremely  complex  one.  In  fact,  given  the  selec¬ 
tion  decision,  the  problem  to  be  solved  in  making  the  scheduling  decision  is  a  "resource  con¬ 
strained  CPM  problem,"  [S,  12,  49,  SO],  At  the  present  time  there  is  no  optimization  algorithm 
capable  of  solving  large  instances  of  this  type  of  problem  (see  Bennington  and  McGinnis  [S]> 
and  based  on  recent  results  in  combinatorics  ([28],  [44]),  there  is  little  hope  that  such  an  algo¬ 
rithm  is  possible.  Thus,  if  solving  the  outfit  planning  problem  requires  a  specific  scheduling 
decision,  any  practical  solution  methodology  will  be  heuristic  in  nature. 

3.1.2.3  Defining  the  Criteria 

The  final  step  in  formulating  a  model  of  the  outfit  planning  problem  is  to  define  the  cri¬ 
teria  by  which  solutions  are  to  be  evaluated.  The  problem  of  evaluation  is  complicated  by  the 
fact  that  there  are  two  distinct  kinds  of  decisions  being  made:  outfitting  stage  selection  and 
activity  scheduling.  Furthermore,  a  number  of  different  viewpoints  could  be  considered,  each 
leading,  possibly,  to  a  different  criterion. 

The  viewpoint  adopted  here  is  that  the  outfit  planning  problem  is  to  be  solved  in  the  con¬ 
text  of  a  number  of  prior,  exogenous  decisions  which  fix  many  of  the  outfit  planning  problem 
parameters.  For  example,  the  milestone  event  times  (such  as  lay  keel,  float  off,  delivery,  etc.) 
are  assumed  fixed,  along  with  the  detailed  block  fabrication  and  erection  schedule.  (Note,  how¬ 
ever,  that  this  analytic  framework  could  be  used  in  deciding  on  the  appropriate  milestone 
schedule.)  Resource  availabilities  are  considered  as  exogenous  factors. 

Within  the  environment  resulting  from  these  exogenous  factors,  the  goal  in  outfit  plan¬ 
ning  is  narrowed  to  that  of  minimizing  the  cost  of  outfitting.  Conceptually,  then,  all  that  is 
required  is  to  estimate  the  outfitting  cost  associated  with  each  of  the  outfitting  alternatives.  The 
best  outfitting  plan  is  the  one  with  the  smallest  total  cost.  The  goal  of  the  scheduling  com¬ 
ponent  of  outfit  planning  is  to  maximize  labor  utilization.  This  is  accomplished  when  there  are 
no  periods  in  which  the  scheduled  work  content  is  less  than  the  available  labor. 
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While  these  two  criteria  are  conceptually  simple,  their  application  may  be  difficult.  In  the 
first  place,  they  require  a  significant  effort  in  detailed  estimation.  The  labor  content,  material 
and  overhead  costs,  and  duration  must  be  estimated  for  each  of  several  alternative  outfit 
methods  for  a  large  number  of  outfit  components.  Current  practice  may  not  require  such  a 
detailed  estimate  for  even  one  alternative.  Clearly,  procedures  and  methods  will  need  to 
developed  for  aggregating  outfit  components  in  the  activity  network  and  for  semi-automating 
the  estimation  at  the  necessary  level  of  detail.  The  information  required  for  this  estimation 
process  will  have  to  be  accumulated  over  time  as  there  is  more  experience  with  on-unit,  on- 
block,  and  on-board  outfitting. 

A  preliminary  and  crude  approach  to  the  first  criterion  is  the  following.  Assume  that  the 
savings  to  be  realized  by  outfitting  earlier  in  the  production  process  is  a  constant  fraction  of  the 
total  cost  to  outfit  on-board.  The  fraction  could  vary  with  the  type  of  outfitting  (e.g.,  electrical 
vs.  hydraulic  systems),  or  with  the  total  cost  of  the  outfitting  activity  or  some  other  factor.  The 
criterion  then  becomes  one  of  maximizing  the  total  savings  over  outfitting  completely  on-board. 

3.2  Model  Evaluation 

A  mathematical  model  has  been  developed  to  describe  the  outfit  planning  problem  and 
appears  in  Appendix  A.  This  model  is  in  the  form  of  a  mixed  integer  programming  problem 
and,  consequently,  it  presents  formidable  difficulties  in  solution.  In  fact,  recent  theoretical 
development  [28]  have  been  interpreted  as  indicating  that  such  problems  (referred  to  as  "NP- 
complete")  cannot  be  solved  optimally.  Certainly  it  is  true  that,  currently,  practical  problems  of 
this  ilk  are  not  optimized.  There  are,  however,  a  number  of  heuristic  solution  procedures 
which  have  been  developed  and  used  successfully  to  solve  similar  problems  (e.g.,  see  [38]). 

Obviously,  the  model  by  itself  cannot  lead  to  better  outfit  planning.  What  is  required  is  a 
systematic  implementation  of  the  model.  There  are  several  requirements  for  a  successful 
implementation  of  the  model,  and  these  can  be  more  easily  discussed  by  referencing  the 
diagram  of  Figure  9. 

One  of  the  requirements  for  a  successful  implementation  is  an  appropriate  methodology 
for  solving  the  selection  and  scheduling  problem  for  given  milestone  events  and  resource 
availabilities.  As  was  indicated  earlier,  there  is  little  hope  for  a  general  optimizing  method  for 
solving  this  problem,  so  in  the  most  general  case,  the  solution  procedure  will  be  heuristic. 

The  model  requires  large  amounts  of  information  and  generates  large  numbers  of  detailed 
decisions.  Thus,  any  practical  implementation  will  require  a  fairly  detailed,  production  oriented 
date  base  to  support  the  solution  procedure.  Although  many  shipyards  do  not  have  such  a  data 
base  at  the  current  time,  the  SPARDIS  system  used  by  NASSCO  [45]  is  one  example  of  the 
type  of  system  that  would  be  required. 

A  third  requirement  is  that  the  outfit  planning  process  could  in  fact  provide  all  the 
information  required  in  the  model.  It  appears  that  a  major  shift  from  current  practice  would  be 
the  idea  of  allowing  (and  therefore  planning  for)  several  alternative  ways  to  accomplish  the 
outfitting  tasks.  In  addition,  the  use  of  the  on-unit,  on-block,  on-board  approach  to  outfitting  is 
not  currently  widespread,  although  it  is  being  strongly  supported  as  a  means  for  improving  pro¬ 
ductivity  (8). 
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■  Hunan 
Component 


■  Process 


"  Data  Base 


Figure  9.  Outfit  planning  process 


Given  that  the  zone  outfitting  approach  has  been  adopted,  defining  the  alternative  outfit 
activities  discussed  earlier  should  be  straight-forward,  albeit  somewhat  time-consuming. 
Observe  that  to  a  large  degree,  the  outfit  elements  are  associated  with  particular  blocks.  There¬ 
fore,  an  activity  network  resembles  a  large  number  of  small  subnetworks  (one  for  each  block) 
which  are  loosely  connected  by  milestone  events.  It  will  be  possible  to  ’decompose*  the  net¬ 
work  definition  into  smaller,  more  manageable  tasks. 

Figure  9  also  indicates  how  the  model  might  be  used  in  practice.  The  use  of  the  model 
for  planning  the  outfitting  of  a  ship  as  self-evident.  Probably  as  important  is  the  use  of  the 
model  to  "replan*  when  there  are  major  deviations  from  the  original  plan,  e.g.,  due  to  weather, 
change  orders,  priority  repair  work,  etc. 

A  final  point  of  discussion  is  the  benefit  to  be  obtained  by  the  use  of  the  model.  The 
foremost  benefit  of  the  model,  per  se,  is  tighter  planning  and  control  of  outfitting,  resulting  in 
higher  productivity  (and  thus  lower  costs).  In  project-type  work,  such  as  ship  production,  it  is 
important  to  correctly  estimate  the  labor  content  of  the  work  and  then  plan  the  work  so  that 
labor  resource  utilization  is  maximized.  The  proposed  model  provides  a  systematic  means  for 
coping  with  and  coordinating  the  vast  number  of  relationships  which  simply  cannot  he  handled 
by  an  unaided  human  planner. 

A  secondary  benefit  from  the  proposed  model  is  that  it  complements  and  strengthens  the 
implementation  of  the  on-unit,  on-block,  on-board  approach  to  outfitting.  It  provides  a  sys¬ 
tematic  framework  for  identifying  opportunities  for  on-unit  and  on-block  outfitting  as  well  as 
for  determining  the  technical  and  economic  feasibility  of  outfitting  plans. 
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APPENDIX  A:  THE  MATHEMATICAL  MODEL 

In  developing  the  conceptual  model,  two  types  of  decisions  were  identified:  selection 
decisions  and  scheduling  decisions.  It  will  be  convenient  to  formalize  the  selection  decisions 
first.  Associate  with  each  outfit  component  an  index,  i,  where  i  -  1,  2,  ....  N,  N  being  the 
total  number  of  outfit  components.  Similarly,  associate  with  each  outfit  unit  an  index 
j  -  1 . Af,  and  with  each  block  an  index  6-1,  . . .  ,  B. 

The  selection  decisions  will  be  represented  by  indicator  variables.  For  a  particular  com¬ 
ponent,  i,  the  variables  are: 

(1  if  component  i  is  outfit  on- unit 
0  otherwise 

1  if  component  i  is  outfit  on— 6 lock 
Xi  ”  0  otherwise 

1  if  component  i  is  outfit  on-board 
x*  -  (in  the  Aull) 

0  otherwise 

Exactly  one  of  the  indicator  variables  must  equal  one  for  any  component.  However,  not  all 
stages  can  be  selected  for  each  element.  Therefore,  group  the  indices  as  follows: 

F  -  set  of  indices  of  free  outfit  components 

N  —  set  of  indices  of  nonunit  outfit  components 

B  -  set  of  indices  of  on-board  outfit  components 

These  sets  are  pairwise  disjoint.  Now  the  component  selection  decisions  must  satisfy: 


(1) 

x,“  +  xf  +  x/1  -  1 

/ «  F 

(2) 

xf  +  x/1-  1 

i  c  N 

(3) 

xf  —  1 

it  B. 

Note  that  there  is  only  one  on-board  option.  The  model  can  be  readily  extended  to  allow  for 
pre-float  off  and  wet-dock  on-board  outfitting.  In  order  to  simplify  the  exposition,  this  exten¬ 
sion  is  not  included. 

There  are  similar  indicator  variables  associated  with  each  unit: 

{1  if  unit  j  is  selected  for  assembly 
0  otherwise 

b  ll  if  unit  J  is  installed  on-block 
yJ  "  |0  otherwise 
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y} 


|l  if  unit  J  is  installed  on-board 
[0  otherwise 


Since  a  unit  cannot  be  installed  unless  it  is  first  assembled,  the  unit  selection  variables  must 
satisfy  the  following  constraint: 

(4)  yf  +  yjh-  Zj-0  V  j. 

The  unit  selection  decisions  and  element  selection  decisions  must  be  tied  together. 
Define  the  following  index  sets: 

L  ( j )  -  set  of  indices  of  components  in  the  minimum  outfitting  kit  for  unit  j 

M(J )  -  set  of  indices  of  components  in  the  maximum  outfitting  kit  for  unit  j 
The  element  and  unit  selection  variables  must  satisfy: 

(5)  T  */- IU.(/)IU/-0  Vj 
uTXj ) 

(6)  T  xr-  \\M{J)\\zj  4  0  Vj 

where  l|S||  is  the  number  of  elements  of  the  set  £ 

Constraint  (5)  requires  that  if  unit  j  is  selected  (zj  -  1),  then  all  the  components  in  the 
minimum  outfitting  kit  for  that  unit  also  must  be  selected.  Constraint  (6)  permits  additional 
components  to  be  included  in  the  unit  only  if  the  unit  is  fabricated. 

The  constraints  CD- (6)  are  logical  constraints  and  merely  guarantee  consistency  between 
the  indicator  variables  and  the  decisions  they  represent.  In  addition,  there  are  structural  con¬ 
straints  which  must  be  satisfied.  One  of  these  is  the  precedence  relationships  defined  by 
sequencing  requirements.  Define 

P(J )  -  index  set  of  components  (units)  which  must  precede  component  (unit)  J  in 
production. 

Then  the  precedence  constraints  on  the  selection  decisions  are: 

(7)  xj1  +  2x?  +  3x r-  Or/  +  2xf  +  3x/)  <0  I  <  P(J) 

(8)  yf  +  2 >/ -  (y*4  +  2y£)  <  0  J,  P(k ) 

Constraints  (7)  and  (8)  require  that  for  any  component  or  unit,  its  predecessors  must  be 
outfitted  or  installed  at  the  same  or  an  earlier  production  stage. 

A  second  category  of  structural  constraints  limits  the  total  weight  added  to  a  unit  or  block. 
Note  that  these  limits  may  be  facility  dependent,  i.e.,  units  fabricated  in  different  shops  may 
have  different  weight  limits. 

(9)  T  w,xT<  Wj  Vj 

i*mj) 

00)  T  |  T  »,jAy}+  T  w,x/<  Wb  Vb 

JtUli)  1/tWO)  J  HHbnjN(b) 
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where 


w,  -  weight  added  by  outfit  element  / 

Wj  -  maximum  weight  allowed  for  unit  j 

Uib)  -  units  which  go  into  block  b 

Nib)  -  subset  of  components  of  N  which  go  into  block  b 

Fib)  -  subset  of  componnts  of  F which  go  into  block  b 

Wb  -  maximum  outfitting  weight  added  to  block  b. 

The  first  term  in  constraint  (10)  is  the  total  weight  of  units  which  are  selected  for  installation 
on-biock.  The  second  term  is  the  total  weight  of  components  (not  part  of  a  unit)  which  are 
outfitted  on-block. 

In  order  to  deal  with  the  time  and  labor  availability  constraints,  the  scheduling  .lecisions 
must  be  formalized.  Define  the  following  scheduling  variables: 

t,  -  scheduled  start  time  for  component  /outfitting 

Oj  -  scheduled  time  for  completing  unit  j  fabrication 

Tj  —  scheduled  start  time  for  unit  J  installation  . 

The  scheduling  variables  must  satisfy  all  the  precedence  constraints  as  well  as  the  scheduling 
limitations  imposed  by  the  steel  schedule. 

First,  consider  the  constraints  involving  on-unit  outfitting. 

(11)  ttxf  +  dfxf -  tjxJ  <0  it  Pij) 

where  </?*-  time  to  outfit  component  /  on  unit.  Constraint  (11)  requires  that  all  predecessors 
of  component  j  must  be  completed  before  component  yean  be  outfitted  on  unit. 

(12)  t,xf  +  dfxf  -  BjZj  <  0  V  i  *  M  0 ) 

Constrain!  (12)  requires  all  on-unit  outfitting  to  be  completed  before  the  unit  itself  is  com¬ 
pleted.  ' 

(13)  +  dj  -  tj  <  0  V  j 

where  dj  —  material  handling  delay  for  unit  j. 

Constraint  (13)  is  included  to  allow  for  possible  significant  material  handling  delay  or  resource 
requirement. 

The  installation  of  units  and  outfit  components  on-btock  must  not  only  satisfy  precedence 
but  "schedule  window”  constraints  as  well. 
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(14)  r jyf  +  dfyf  -  rky£  <  0  jt  Pik) 

where  df  is  the  time  required  to  install  unit  j  on-block. 

Constraint  (14)  forces  the  on-block  installation  of  unit  k  to  be  after  the  on-block  installation  of 
its  predecessors. 

(15)  Tj  -  rb>0  Vj  c  U(b) 

where  7}  -  earliest  possible  time  for  on-block  outfitting  on  block  b. 

Constraint  (15)  forces  the  installation  of  the  unit  j  to  be  after  the  time- when  installation  is 
feasible. 

(16)  rjyf  +  d}y}  -  T(  <  °  V  j  c  U  (b ) 

where  T{  -  latest  possible  time  to  complete  outfitting  on  block  b. 

Constraint  (16)  sets  the  deadline  for  on-block  installation  of  units. 

There  are  similar  precedence  and  schedule  window  constraints  for  the  on-block  outfitting 
of  free  and  nonunit  components: 

(17)  /,*,»  +  ~  tjXj  <  0  j  «  />(/) 

(18)  V-  V,  x?  >  0  it  Fib )  JN(b),  V  b 

(19)  V  +  4V  ~  T{  <  0  it  Fib )  U  Nib),  V  b. 

These  same  precedence  and  schedule  window  constraints  are  repeated  for  both  units  and  ele¬ 
ments  for  on-board  outfitting.  For  the  units,  the  constraints  are: 

(20)  r jyf  +  dfyf  -  rky £  <  0  j  $  Pik) 

(21)  t j  -  rhyjh  >  0  V  j 

where  7J  —  earliest  possible  time  for  installing  unit  on-board. 

(22)  Tjyf+dfy}-  7]f <  0  Vj 

where  T{  -  latest  possible  time  for  installing  unit  on-board. 

For  the  outfit  components,  the  corresponding  constraints  are: 

(23)  V  +  4V "  tj*J  <  0  jt  Pii) 

(24)  t,xf-  rkx?>  0  Vi 

(25)  trf+dfxf-  7j(<0  Vi. 

In  addition  to  precedence  and  schedule  window  constraints,  the  scheduling  decisions  must 
be  feasible  with  regard  to  the  resource  availabilities.  Resource  availability  constraints  are  quite 
difficult  to  formulate  in  explicit  terms,  so  the  following  approach  is  typically  used  (see,  e.g., 
models  in  IS]  and  (11)).  Define  the  following: 
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Ae(t)  —set  of  outfit  components  being  outfitted  at  time  t 
Au(t)  -set  of  outfit  units  being  installed  at  time  t 

rkM  “  level  of  resource  category  c  required  by  component  i  when  outfitted  on-unit 

rkb  -  level  of  resource  category  c  required  by  component  i  when  outfitted  on-block 

rkh  -  level  of  resource  category  c  required  by  component  i  when  outfitted  on-board 

rtf  -  level  of  resource  category  c  required  to  fabricate  unit  j 

rjtb  "  level  of  resource  category  c  required  to  install  unit  j  on-block 

rjch  -  level  of  resource  category  c  required  to  install  unit  j  on-board 

Rc,  -  level  of  resource  category  c  available  at  time  t 


Now  the  resource  availability  constraints  are: 

T  ('/«,*"  +  rk bx?  +  r^xf)  + 

(26)  X  (fjcfZj  +  tjarf  +  r^yp  <  Rcl. 

The  difficulty  with  using  such  a  constraint  is  that  the  sets  Ae(t )  and  Au(t)  depend  on  the 
scheduling  decisions.  In  fact  this  is,  to  a  certain  degree,  the  nub  of  the  resource  constrained 
project  scheduling  problem. 

The  constraints  (l)-(26)  can  be  shown  to  be  redundant.  For  example,  if  the  scheduling 
related  precedence  constraints,  (ll)-(25),  are  satisfied,  then  the  selection  related  precedence 
constraints,  (7)  and  (8),  must  necessarily  be  satisfied.  The  reason  for  including  the  redundant 
constraints,  (7)  and  (8),  is  to  allow  for  solution  procedures  which  try  to  decouple  the  selection 
and  scheduling  decisions. 

Since  the  criterion  specified  for  the  outfit  planning  problem  is  to  minimize  outfitting  costs, 
define: 

C*  —  cost  of  outfitting  element  /  on-unit 
C/()  -  cost  of  outfitting  element  i  on-block 
CM  -  cost  of  outfitting  element  t  on-block 
Cjt  -  cost  to  install  unit  j  on-block 
Cjt,  -  cost  to  install  unit  J  on-board. 


The  objective  function  for  the  mathematical  model  is: 

Minimize  £  (C**,1'  +  C*jr,*+  C* xfl  +  £  I CAy/+  C^/l. 

'  j 
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ABSTRACT 

Regression  analysis  is  used  10  estimate  the  effects  on  Navy  enlistments  of 
recruiters,  advertising,  unemployment  and  other  factors.  In  measuring  the 
effects  of  these  factors  on  enlistments,  changes  in  both  the  demand  for  and 
supply  of  en  .stmems  are  taken  into  account.  Advertising  is  treated  as  a  capital 
investment;  it';  effect  is  estimated  using  a  maximum  likelihood  technique  that 
was  developed  for  measuring  the  effects  of  capital  investments.  We  find  strong 
evident  that  recruiters  and  unemployment  increase  enlistments.  Advertising 
also  seems  to  increase  enlistments,  but  its  effect  is  highly  uncertain.  The 
results  suggest  that  increases  in  recruiting  resources  would  have  eliminated  the 
enlistment  shortfalls  experienced  by  the  Navy  in  FY  1978-79. 


INTRODUCTION 

Navy  recruiting  efforts  were  largely  successful  unitl  FY  1978.  Since  the  advent  of  the 
All- Volunteer  Force  (AVF)  in  1973,  the  Navy  frequently  achieved  its  enlistment  goals  or  only 
slightly  fell  short.  However,  shortfalls  averaged  10  percent  in  FY  1978  and  FY  1979,  and  they 
are  expected  to  increase  further  in  the  1980s  because  of  a  decline  in  the  youth  population. 

Similar  recruiting  problems  have  also  been  experienced  by  the  other  Services.  As  a 
result,  some  view  the  AVF  as  a  failure  and  urge  that  we  return  to  the  Draft.  But  a  draft 
involves  many  hidden  social  costs  and  inequities  which  make  it  an  undesirable  option.* 
Instead,  solutions  consistent  with  an  AVF  are  likely  to  be  less  costly  to  society,  if  not  the 
Defense  Department’s  manpower  budget,  and  such  solutions  should  be  chosen. 

The  Services’  major  difficulty  has  been  an  inability  to  recruit  enough  male  high  school  gra¬ 
duate  enlistments  (HSGs).  One  set  of  AVF  solutions  involves  options  that  would  reduce 
demand  for  them.  This  could  be  accomplished,  for  example,  by  substituting  for  male  HSGs, 
nonHSGs  and  HSGs  in  the  lower  mental  groups  as  well  as  women  and  civilians. 

Another  set  of  options  would  increase  the  supply  of  HSGs.  One  frequently  recommended 
by  economists  is  to  raise  military  pay  and  benefits,  but  this  is  a  costly  option:  higher  pay  must 
be  given  to  the  entire  stock  of  manpower  as  well  as  to  new  recruits;  enlistment  bonuses  are  a 
more  selective  instrument,  but  they  too  must  be  paid  to  a  large  population  (all  enlistees). 

Another  option  is  to  increase  the  amount  of  resources  devoted  to  recruiting,  i.e.,  recruiters 
(R)  and  advertising  (A).  This  approach  may  be  less  expensive  than  raising  pay  or  giving 

*For  discussion  see  Reference  (61,  Appendix  M. 
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bonuses  because  it  eliminates  extra  payments  or  rents  to  those  who  would  volunteer  anyway. 
Unfortunately,  there  is  little  evidence  on  the  productivity  of  R  and  A,  so  it  is  difficult  to  esti¬ 
mate  the  cost  of  using  more  to  reduce  shortfalls.  To  help  evaluate  such  an  alternative,  this 
study  analyzes  the  effects  of  recruiters  and  advertising  on  Navy  HSGs. 


Previous  Studies 

There  have  been  numerous  studies  on  enlistments.*  In  these  regression  analysis  was  used 
to  relate  the  enlistment  rate,  e.g.,  HSGs  per  capita,  to  factors  such  as  pay  and  unemployment. 

Typically,  it  is  assumed  that  HSGs  are  limited  solely  by  supply  factors  and,  therefore,  the  flow 
of  HSGs  is  independent  of  demand  considerations.  But  this  is  a  questionable  assumption 
because  the  demand  for  enlistments  seems  to  have  had  a  strong  effect  on  HSGs.  As  illustrated 
in  Figure  1,  there  is  a  strong  relationship  over  the  period  FY  1972-78  between  HSGs  and  the 
demand  for  total  enlistments,  i.e.,  quotas.  The  relationship  seems  to  hold  for  all  HSGs  as  well 
as  those  in  the  upper  mental  groups  (MGl-3U).t  HSGs  seem  to  be  strongly  affected  by  the 
demand  for  manpower.*  Thus,  we  observe  a  reduced  form  equation  for  HSGs  rather  than  a 
supply  function  as  assumed  by  previous  researchers. 

What  might  cause  a  positive  association  between  quotas  and  HSGs?  Perhaps  recruiters 
simply  worked  less  hard  when  faced  with  low  quotas.  Such  a  response  seems  reasonable  as  long 
as  recruiting  goals  are  achieved,  which  was  largely  the  case  prior  to  FY  1978. 

In  FY  1978,  however,  there  were  large  shortfalls;  as  a  result  we  presume  a  high  level  of 
recruiter  effort  —  yet  there  were  relatively  few  HSGs.  Thus,  low  recruiter  effort  does  not 
explain  why  there  were  few  HSGs  in  FY  1978.  Still,  changes  in  recruiter  effort  might  partially  j 

explain  Figure  1  in  earlier  years. 

A  relationship  between  quotas  and  standards  may  better  explain  Figure  1  as  well  as  the 
FY  1978  results.  The  Navy  has  tended  to  lower  standards  to  reduce  shortfalls  and  raise  them 
when  faced  with  an  excess  supply  of  applicants.§  For  example,  when  faced  with  high  quotas 
and  the  prospect  of  large  shortfalls  in  FY  1972,  standards  were  lowered  enabling  more  high 
school  graduates  in  the  lower  mental  groups  to  enlist  in  the  Navy.  - 

As  quotas  changed,  changes  were  brought  about  in  certain  supply  factors.  Whatever  they 
are,  recruiters'  efforts  or  standards,  it  is  necessary  to  control  for  their  effects  to  measure  the 
productivity  of  R  and  A.** 


•For  examples,  see  Reference  [1-31,  15-15]. 

t There  are  five  mental  groups.  The  upper  mental  groups  are  one,  two,  and  the  upper  half  of  three. 

(The  Navy's  demand  for  manpower  is  essentially  independent  of  the  supply  of  HSGs.  Quotas  are  based  on  projections 
of  total  manpower  requirements  rather  than  on  projections  of  supply  factors  such  as  unemployment  rates. 

(For  evidence  on  the  relationship  between  quotas  and  standards,  see  Reference  [151,  pp.  4-5,  17  and  28-29. 

••During  FY  1972-77,  the  Navy  permitted  standards  to  fall  as  a  means  of  meeting  enlistments  goals.  We  suspect  that 
what  has  happened  recently,  is  that  the  Navy  has  set  high  standards  and  has  decided  to  hold  the  line  on  them.  More¬ 
over,  standards  are  probably  so  high  that  any  put  slack  in  recruiter  effort  hu  now  largely  disappeared.  Because  chang¬ 
ing  standards  and  recruiter  effort  no  longer  seem  to  be  available  u  equilibration  mechanisms,  the  Navy  must  look  to 
other  supply  variables  u  a  means  of  reaching  in  enlistment  goals,  or  it  must  lower  those  goals  by  substituting  other 
cohorts  for  male  HSGs. 
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The  study  asks  how  many  HSGs  would  be  generated  by  additional  R  and  A.  We  answer 
this  question  using  regression  analysis  of  quarterly  data  for  3Q  1971  •  4Q  1977.*  Our  metho¬ 
dology  is  unusual  in  adjusting  for  the  effects  of  changes  in  demand  and  in  the  treatment  of 
advertising  which  is  considered  a  capital  investment.  The  study  provides  the  first  estimate  of 
the  productivity  of  Navy  advertising  in  generating  enlistments. 

REGRESSION  MODEL 

Supply  Factors 

We  assume  the  supply  of  HSGs  to  the  Navy  depends  on  the  following  factors:  (1)  HSG 
recruiting  efforts;  (2)  enlistment  standards;  (3)  awareness  of  the  Navy  as  a  career  choice;  (4) 
relative  economic  benefits  of  enlisting;  and  (S)  size  of  the  youth  population.  Measures  of  these 
factors  will  be  used  to  analyze  high  school  graduate  enlistments. 

HSG  Recruiting  Efforts 

Recruiters  have  more  than  a  passive  role;  they  visit  high  schools,  for  instance,  and  give 
talks  on  the  advantages  of  a  Navy  career.  We  suspect  that  differences  in  recruiters'  efforts  have 
had  measurable  effects  on  HSGs.  We  therefore  choose  as  a  factor  the  effort  recruiters  made  to 
enlist  high  school  graduates. 

This  factor  requires  special  handling  because  it  cannot  be  measured  directly:  The  Navy 
collects  no  systematic  data  on  how  much  time  recruiters  spend  in  active  search  of  high  school 
graduates.  We  will  have  to  use  a  number  of  variables  to  measure  the  effects  of  this  factor. 

The  desired  measure— recruiter  man-years  devoted  to  enlisting  high  school  graduates— 
depends  on  the  size  of  the  recruiting  force,  their  overall  effort,  and  the  percentage  of  time 
devoted  to  recruiting  high  school  graduates.  The  total  number  of  personnel  with  a  recruiter  rat¬ 
ing  is  included  as  a  measure  of  the  recruiting  force.  (This  measure  includes  recruiters  who  are 
primarily  administrators.t  While  we  would  have  preferred  to  exclude  administrators,  data  on 
the  number  of  them  are  not  available  for  the  entire  period.) 

To  adjust  for  differences  in  total  effort  per  recruiter,  we  would  prefer  to  use  the  exo¬ 
genously  determined  quota.  Instead,  we  use  actual  enlistments  (E)  because  quarterly  quota 
data  are  unavailable  for  FY  1972.4  Since  the  Navy  generally  makes  goal  during  the  regression 
period— the  shortfall  averages  3.5  percent— total  enlistment  is,  with  a  few  exceptions,  a  close 
approximation  of  the  quota  and,  therefore,  it,  too,  is  exogenously  determined. 

Since  CY  1975,  all  the  Services  have  been  pressured  by  Congress  and  the  Department  of 
Defense  to  increase  the  quality  of  enlistments.  The  Services  responded  by  limiting  mental 


*The  Drift  wu  still  in  effect  during  the  ftret  *i*  quarters  (3Q  1971  •  4Q  1972)  used  to  measure  the  effects  of  R  and  A. 
In  reality,  the  Services  treated  this  as  a  tested  period  for  recruiting  in  an  AVF  environment  R  and  A  were  sharply  in¬ 
creased.  there  were  few  inductions,  and  recruiters  were  pressured  through  high  goals  to  produce  large  numbers  of  en¬ 
listments.  It  seems  appropriate,  therefore,  to  measure  the  effects  of  R  and  A  with  data  from  these  quarters, 
tin  FY  1977,  about  18  percent  of  the  recruiting  force  were  administrators.  These  recruiters  did  not  receive  quotas. 
Administrators  are  all  recruiters  except  production  recruiters  and  recruiters  in  charge  of  stations  having  three  or  less  re¬ 
cruiters. 

SWe  cannot  simply  drop  quarters  in  FY  1972,  because  there  are  too  few  observations  and  most  of  the  variation  in  re¬ 
cruiters  occurs  that  year. 
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group  four  enlistments  and  by  increasing  recruiters’  efforts  devoted  to  enlisting  high  school  gra¬ 
duates.  A  dummy  variable  (NAVPL)  is  included  to  adjust  for  the  change  in  high  school  gradu¬ 
ate  recruiting  effort  starting  in  CY  1975. 

Standards 

We  suspect  that  changes  in  standards  strongly  affected  HSGs.  Since  accurate  data  on  stan¬ 
dards  are  unavailable,  we  are  forced  to  again  use  total  enlistments  as  a  proxy.*  We  assume  that 
standards  for  HSGs  declined  as  the  Navy  increased  its  demand  for  total  enlistments.  HSGs 
increased  as  standards  were  lowered,  because  the  Navy  accepted  more  HSGs  in  the  lower  men¬ 
tal  groups  and  waived  various  entry  requirements  for  them. 

Lacking  data  on  HSG  recruit  effort  and  standards  applying  to  them,  we  use  total  enlist¬ 
ments  as  a  proxy  for  both:  we  assume  that  as  E  increases,  HSG  recruiting  effort  increases  and 
standards  applying  to  HSGs  decline;  these  changes  increase  the  supply  of  HSGs.  We  also  use  a 
dummy  variable  (NAVPL)  as  a  proxy  for  changes  in  HSG  recruiting  effort  not  captured  by  E. 

We  assume  HSG  supply  and  demand  are  in  balance  in  every  quarter  during  the  regression 
period.  As  demand  for  total  enlistments  increase,  there  is  an  increase  in  the  demand  for  HSGs. 
This  is  met  by  increasing  the  supply  of  HSGs  through  increases  in  recruiting  effort  and  reduc¬ 
tions  in  standards. 

Thus,  we  do  not  observe  a  given  supply  function  in  the  sense  that  recruiter  effort  and 
standards  are  unchanging  throughout  the  regression  period.t  Instead,  we  observe  points  on  a 
shifting  supply  function  caused  by  changes  in  these  factors,  and  shifts  are  linked  to  the  Navy’s 
total  demand  for  enlistments.  By  including  E  and  NAVPL  we  adjust  for  such  changes,  and  esti¬ 
mate  a  reduced  form  equation  for  high  school  graduate  enlistments. 

Population’s  Awareness  and  Advertising 

Previous  studies  on  advertising  have  focused  on  measuring  its  effects  in  product  markets: 
the  consensus  is  that  advertising  increases  a  product’s  demand  in  current  and  future  periods, 
but  its  effects  decline  over  time.* 

Therefore,  we  treat  Navy  advertising  as  an  investment  in  a  stock  of  intangible  'awareness 
capital"  which  depreciates  over  time.  This  treatment  permits  its  effects  to  last  for  more  than 
one  period  and  to  decline  over  time.  We  estimate  the  effects  of  awareness  capital  on  HSGs, 
and  given  that,  calculate  the  effect  of  advertising  from  the  relationship  between  it  and  aware¬ 
ness  capital. 

Construction  of  a  capita]  stock  series  for  advertising  requires  that  we  have  data  on  the 
advertising  effort  each  quarter,  as  well  as  the  base  period  stock  and  the  depreciation  rate  of 
awareness  capital.  Only  annual  data  are  available  on  advertising;  lacking  quarterly  data,  we 
assume  the  annual  advertising  budget  is  spent  evenly  over  the  fiscal  year.  We  also  assume  that 


'Another  problem  is  that  standards  are  multi-dimensional.  Even  if  data  were  available  on  the  many  components  of 
standards,  increasing  the  number  of  variables  when  there  are  so  few  data  points  would  reduce  the  statistical  reliability 
of  the  results. 

tHere  our  approach  is  different  from  that  taken  by  most  previous  researchers.  They  have  assumed  recruiters'  efforts 
and  standards  are  unchanging. 

tFor  a  review  of  literature  on  this  issue,  see  Reference  (4). 
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2S0  dollars  in  this  quarter’s  advertising  budget  (in  constant  1967  dollars)  generates  one  unit  of 
awareness  capital  in  the  following  quarter.* 

Because  the  other  data  are  unavailable  we  still  face  serious  measurement  problems,  but 
estimation  procedures  are  developed  to  handle  them.  A  way  is  found  to  express  the  base 
period  stock  as  a  regression  parameter.  This  eliminates  the  need  to  know  it  to  construct  the 
awareness  capital  series.  A  maximum  likelihood  procedure  is  used  to  solve  the  problem  of  not 
knowing  the  depreciation  rate. 

Measurement  of  Awareness  Capital 

Let  K0  be  the  base  period  stock  of  awareness  capital  and  8  its  depreciation  rate  per  quar¬ 
ter.  In  subsequent  periods  the  net  capital  stock  depends  on  Ko,  8,  and  the  stream  of  advertis¬ 
ing  investments  (/,): 

AT,  -  tf0(l  -  8)  +  lo 

K2  -  A0(l  -  8)2  +  /„(1  -  8)  +  /, 

Ki  -  tf0(l  -  8)3  +  /0<i  “  8)2  +  /|0  -  8)  +  /2 


K,  -  Af0(I  -  8)'  +  /0(1  -  «)'-'  +  /,0-  8)'~2  +  . . . 

The  net  capital  stock  in  any  period  t  (except  for  the  base  period)  can  be  written  as  the  sum  of 
two  expressions 

(1)  K,  -  K0X,(8)  +  Y,(8) 
where 

X,(8)  -  (1  -  8)' 

K,( 8)  -  /0(1  -  8)'-'  +  /,(1  -  8)'~2  +  ...  + 

By  defining  Y0  equal  to  zero,  Equation  (i)  also  holds  in  the  base  period. 

Our  approach  is  to  include  both  X,(8)  and  Y,(8)  as  explanatory  variables  in  a  regression 
model  relating  HSGs  to  various  factors.  For  example,  given  the  regression  model 

(2)  dependent  variable  «■...  +  o*K,  +  . . . 
we  substitute  for  K,  to  yield 

(3)  dependent  variable  «■  . . .  +  a*K0X,( 8)  +  a*Y,( 8)  +  . . . 

Along  with  K0,  X,{8)  and  T,(8)  measure  the  stock  of  awareness  capital.  The  coefficient 
of  K,(8)  is  the  effect  on  the  dependent  variable  in  the  regression  model  of  a  one  unit  change  in 
the  awareness  capital  stock. 


*Advertisins  wu  adjusted  for  inflation  usins  an  overall  media  price  index  prepared  by  the-McCann-Erickson  advertising 
agency. 
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The  coefficient  of  X,(6)  is  equal  to  the  coefficient  of  Y,(fi)  times  K0.  Both  variables 
should  have  a  positive  sign.  An  F-test  on  X,(B)  and  Y,( 8)  together  is  used  to  test  whether 
awareness  capital  has  a  statistically  significant  effect  on  HSGs. 

Relative  Economic  Benefits 

Four  factors  are  included  to  measure  changes  in  relative  economic  benefits. 

Pay 

The  first  is  the  ratio  of  military  to  civilian  pay.  Holding  other  factors  constant,  it  is 
expected  to  have  a  positive  effect  on  HSGs.  The  preferred  measure  involves  using  the  ratio  of 
civilian  to  military  discounted  earnings  streams.  However,  assuming  long-run  earnings  growth 
rates  of  civilians  and  enlistees  are  roughly  the  same,  the  measure  reduces  to  simply  the  ratio  of 
military  to  civilian  pay  in  a  base  period.  We  make  this  assumption  and  use  as  our  measure  of 
relative  pay  the  ratio  of  expected  military  to  expected  civilian  earnings  during  an  enlistees  first 
term. 


Civilian  pay  is  measured  by  the  sum  of  four-years’  full-time,  before-tax  earnings  for  a 
typical  18-21  year  old  male  worker.  Similarly,  military  pay  is  measured  by  four-years’  before¬ 
tax  base  pay  for  a  typical  enlistee.  We  exclude  from  military  pay  in-kind  earnings  (quarters  and 
subsistence  allowances  and  their  tax  advantage),  since  they  have  hardly  changed  in  real  terms 
over  the  period. 

Unemployment 

The  second  factor  is  the  unemployment  rate  (UNEMP).  As  it  increases,  expected  civilian 
earnings  decline  and  it  becomes  more  difficult  and  costly  to  find  a  civilian  job.  The  unemploy¬ 
ment  rate  of  16-19  year  old  white  males  is  used  to  measure  unemployment.*  An  increase  in 
the  unemployment  rate  is  expected  to  increase  HSGs. 

GI  Bill  Benefits 

Starting  in  January  1977,  GI  Bill  benefits  were  reduced  and  the  government  no  longer 
paid  for  all  of  a  veteran’s  post-service  education.  Instead,  for  every  dollar  the  individual  pays 
the  government  will  contribute  two.t  This  change  reduced  the  relative  economic  benefits  of 
enlisting.  A  dummy  variable  starting  in  January  1977  (GIBL)  is  included  as  a  factor.  It  is 
expected  to  reduce  HSGs. 

Ending  of  the  Draft 

Although  the  law  authorizing  conscription  expired  on  1  July  1973,  for  all  practical  pur¬ 
poses,  the  Draft  ended  in  January  1973.  By  eliminating  the  risk  of  having  to  serve  in  the 
Army,  ending  the  Draft  decreased  the  benefits  of  enlisting  in  the  Navy.  An  All-Volunteer 
Force  dummy  variable  starting  in  January  1973  (AVF)  is  included.  Its  coefficient  is  expected  to 
have  a  negative  sign. 


•Unemployment  rates  for  other  youth  cohort*  could  have  been  used;  but  because  they  lead  to  move  together  so  doae- 
ly,  we  did  not  experiment  with  other  rates. 
tFor  discussion  see  Reference  [6],  p.  27. 
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Population 

The  eligible  population  (POP)  is  measured  by  the  number  of  civilian  males  ages  18-24. 
(We  could  use  other  cohorts,  such  as  18-19  year  olds,  but  they  move  together  almost  perfectly 
with  the  18-24  year  old  group,  so  it  would  not  matter  statistically  which  one  is  chosen).*  We 
expect  that  an  increase  in  population  would  increase  HSGs. 

Seasonal  Factors 


Seasonal  factors  such  as  the  ending  of  the  school  year  affect  the  quarterly  flow  of  HSGs. 
We  include  dummy  variables  for  first,  second,  and  third  quarters  (Ql,  Q2,  and  Q3)  to  account 
for  these  factors.  We  expect  the  third  quarter  dummy  variable,  which  coincides  with  the  end  of 
the  school  year,  to  have  a  positive  coefficient.  We  have  no  strong  expectations  concerning  the 
signs  of  the  other  seasonal  dummy  variables. 


Specification  of  the  Model 


As  suggested  by  Figure  1,  we  specify  HSGs  and  MG1-3U  HSGs  to  be  nonlinear  functions 
of  total  enlistments  (E)t 

(4)  tf-E'Z  a,z,  ->  bE1  +  cE  +  v 


where  H  -  HSGs  or  MG1-3U  HSGs  and  the  Z,  are  the  following  factors. 


PAY 

UNEMP  - 
POP 

GIBL  - 
NAVPL  - 


AVF 

RECR  - 

JT,(8)  - 

F,(8)  - 


Qi 

Qi 


Four  years'  military  pay  divided  by  four  years’  civilian  pay 
Unemployment  rate  of  white  males,  16-19  year  old 
18-24  year  old  male  population 

Dummy  variable  (equal  to  one  for  quarters  starting  in  1977, 
zero  otherwise)  measuring  effects  of  declines  in  GI  Bill  benefits 

Dummy  variable  (equal  to  one  for  quarters  starting  in  197$, 
zero  otherwise)  measuring  effects  of  a  Navy  policy  which 
increased  HSG  recruiting  efforts 

Dummy  variable  (equal  to  one  for  quarters  starting  in  1973, 
zero  otherwise)  measuring  effects  of  the  ending  of  the  Draft 

Number  of  recruiters 

(1  —  8)'  for  8s  between  S  and  25  percent 

/0(1  -  8)'-'  +  /,  (1  -  8)'“J  +...  +  /f_i  where  I,  is  the 
advertising  budget  adjusted  for  inflation  in  rand  8s  are  between 
$  and  25  percent 

Dummy  variable  (equal  to  one  in  first  calendar  quarter,  zero 
otherwise)  measuring  seasonality  of  enlistments 

Dummy  variable  (equal  to  one  in  second  calendar  quarter,  zero 
otherwise)  measuring  seasonality  of  enlistments 


•The  11-19  year  oM  group  equaled  30.9  perceot  ±  0.7  peraant  of  the  11-24  pear  oM  cohort  over  tbt  porM  1971-1974. 
tBy  using  £  instead  of  quotas  we  still  estimate  Figure  1.  It  is  shifted  eight ty  to  the  toft,  however,  hsceuss  £wes  9tS 
percent  of  quotas  during  the  regression  period.  HSGe  aad  £are  nonprior  service  males  easMIag  reoorvww. 
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Qi 


Dummy  variable  (equal  to  one  in  the  third  calendar  quarter, 
zero  otherwise)  measuring  seasonality  of  enlistments 

Constant  term 


v 


Error  term. 


This  functional  form  is  chosen  because  it  is  consistent  with  two  assumptions  about  the 
underlying  mechanism.  The  first  is  that  the  contribution  of  each  Z ,  to  the  number  of  HSGs 
should  depend  on  recruiter  effort  and  HSG  recruiting  standards,  represented  by  the  proxy  E. 
The  equation  conforms  in  this  respect:  the  coefficient  of  Zt  is  Ea,.  In  terms  of  changes  in  H, 
which  we  may  want  to  bring  about  through  policy: 

(5)  If"** 

The  second  assumption  is  that  the  contribution  of  recruiter  effort  and  standards  should  show 
decreasing  returns.  That  is,  if  recruiters  are  already  aggressive  and  standards  are  already  low, 
further  changes  of  this  nature  should  exhibit  diminishing  returns.  The  equation  conforms 
because  of  the  negative  coefficient  of  E 

(6)  |f  “  £  a,Z,  -  IbE  +  c. 


Estimation  Procedure 

It  might  be  inappropriate  to  apply  equation  (4)  to  the  data  directly.t  One  of  the  assump¬ 
tions  of  "ordinary  least  squares”  regression  is  that  the  error  term  does  not  depend  systematically 
on  any  of  the  variables.  In  the  present  case,  however,  the  error  term  seems  to  depend  on  E 
this  might  lead  us  to  overestimate  the  variance  of  parameters.  To  remove  the  problem,  we 
divide  equation  (4)  by  EA 

For  this  reason,  the  regression  equation  to  be  estimated  is: 

(7)  ——  “  V  a,Z,  —  bE  +  c  +  fi 

E  7” 

where  the  error  term  n  equals  ?/£§ 

Once  the  coefficients  a,  are  obtained,  equation  (S)  is  used  for  estimating  R  and  A  produc¬ 
tivities. 

A  search  procedure  is  used  to  obtain  maximum  likelihood  estimates.  Adjusting  for  auto¬ 
correlation  using  the  Cochrane-Orcutt  technique,**  regression  models  for  HSGs  and  MG1-3U 
HSGs  are  estimated  assuming  alternative  awareness  capital  depreciation  rates  per  quarter  within 


•This  implies  constant  marginal  effects.  We  tested  this  assumption  for  recruiters. 

tData  were  obtained  from  the  Navy  Recruiting  Command,  the  Defense  Manpower  Data  Center,  the  McCenn-Erickson 
advertising  agency,  the  Urban  Institute,  and  published  sources.  For  details,  see  Reference  (91,  Appendix  A. 
tThe  problem  is  technically  one  of  "heteroscedastidty.*  For  more  discussion,  see  Reference  117),  pp.  259-61 
(Equation  (4)  was  also  estimated  for  HSGs.  Result  change  slightly  .when  equation  (7)  is  used  instead  of  equation  (4). 
While  the  pattern  is  the  same,  some  coefficients  obtained  with  equation  (4)  are  about  20  percent  larger  in  absolute 
value.  The  greatest  difference  is  in  the  pay  coefficient  which  is  SO  percent  larger. 

"For  discussion,  see  Reference  (17),  pp.  27744. 
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the  relevant  range,  e.g.,  in  percents  5,  6,  ...  ,  25.  For  both  HSGs  and  MG1-3U  HSGs,  the 
models  that  maximized  the  likelihood  function  are  those  for  which  the  depreciation  rate  is  11 
percent.*  These  best-fit  models  yield  the  maximum  likelihood  estimates  of  parameters  includ¬ 
ing  8.t 

RESULTS 
Regression  Findings 

Maximum  likelihood  estimates  and  their  t- values  are  reported  in  Table  1.  A  number  of 
factors  have  statistically  significant  effects  on  high  school  graduate  enlistments.  For  all  HSGs, 
total  enlistments  and  the  third  quarter  dummy  variable  are  significant  at  the  one  percent  level. 
The  unemployment  rate,  the  number  of  recruiters,  and  awareness  capital,  i.e.,  X(ll)  and 
Till)  together,  are  significant  at  the  five  percent  level.  Population  and  dummy  variables  GIBL 
and  NAVPL  are  significant  at  the  ten  percent  level.  PAY,  AVF,  Qu  and  Q2  are  not  statistically 
significant.*  The  results  for  MG1-3U  HSGs  are  similar  to  those  for  all  HSGs. 

TABLE  I  —  Regression  Findings 


Independent 

variables 

%  HSG 

t 

%  MG1-3U 

t 

CONSTANT 

-7.99 

-1.60* 

™-TT« 

PAY 

0.655 

1.29 

0.120 

0.28 

UNEMP 

0.0299 

2.64* 

0.0219 

2.02* 

POP 

0.00051$ 

1.63° 

0.000217 

0.84 

E 

-0.0000108 

2.94f 

-0.0000105 

3.30f 

RECR 

0.000244 

2.34* 

0.000215 

2.50* 

X(ll) 

2.01 

1.99* 

1.18 

1.44" 

Y(ll) 

0.00000231 

1.32* 

0.000000852 

0.58* 

GIBL 

0.0453 

1.57" 

0.00306 

0.10 

NAVPL 

0.0972 

1.68" 

0.153 

2.92c 

AVF 

-0.0430 

-0.54 

0.0273 

0.38 

Q\ 

-0.0307 

-1.26 

-0.0180 

-1.02 

Qi 

0.0305 

1.56 

-0.0144 

0.10. 

Cl 

0.157 

3.10° 

0.105 

2.52* 

Jr 

0.899 

N.A. 

0.905 

N.A. 

^<13.  ID 

17.4f 

N.A. 

18.6f 

N.A. 

D-W 

2.63 

N.A. 

2.15 

N.A. 

Rho 

-0.469 

2.65* 

-0.191 

-0.97 

^Statistically  significant  at  ten  percent  level. 

^Statistically  significant  at  five  percent  level. 

'Statistically  significant  at  one  percent  ievel. 

*F-test  of  X(ll)  and  Y(ll)  together  indicate*  statistical  significance  at  five  peroeat  level 
for  HSGs  and  ten  percent  level  for  MG1-3U  HSGs. 


*To  derive  the  likelihood  function,  it  is  assumed  that,  after  adjusting  for  autoeorreiatioa,  error  terms  of  the  regression 
model  given  by  equation  (7)  are  Independent  and  normally  distributed,  having  a  rare  mean  and  constant  variance  in 
each  quarter. 

tFor  more  diacusaion  on  the  estimation  procedure,  see  [1(1  and  [171.  , 

♦The  result  for  PAY  Is  probably  due  to  there  having  been  ttttis  Independent  variation:  most  of  the  variation  eoaatoed 
of  a  slight  decline  toward  the  end  of  the  regression  period  when  the  Navy  pm  a  greater  imphaala  on  recntitlng  HSOa. 
Thus,  the  negative  effects  of  relatively  small  declines  in  PAY  were  probably  overwhelmed  by  the  positive  affects  of 
Changes  in  other  variable*,  e.g.,  recruiting  effort. 
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The  high  ujjusted  R2s  indicate  that  the  models  explain  most  of  the  variation  in  the  depen¬ 
dent  variables.  According  to  the  F-statistics,  the  models  are  statistically  significant  at  the  one 
percent  level.  The  t-statistics  for  Rho  suggest  that  adjustments  for  autocorrelation  are  perhaps 
necessary  only  in  the  regression  model  for  all  HSGs. 

Although  awareness  capital  is  statistically  significant,  its  impact  measured  by  the 
coefficient  of  Y(U)  is  imprecise  *  This  may  be  caused  by  high  collinearity  between  X(ll)  and 
Y(ll)  (the  simple  correlation  coefficient  between  them  is  -0.95). 

Except  for  GIBL,  all  variables  have  the  expected  sign.t  The  effect  is  positive  for  PAY, 
unemployment,  population,  NAVPL,  recruiters,  X(ll),  Y(ll)  and  Q3,  and  negative  for  total 
enlistments  and  AVF.t 

Test  of  the  Model 

In  measuring  the  effects  of  R  and  A,  we  assume  that  advertising  operates  through  a  stock 
of  awareness  capital  having  an  11  percent  depreciation  rate,  and  that  recruiters  have  an  immedi¬ 
ate,  constant  marginal  effect.  We  also  estimated  the  model  assuming  other  treatments  of  R  and 
A ,  which  yielded  evidence  supporting  our  approach  and  findings. 

Treating  advertising  as  a  stock  rather  than  a  flow  improves  the  fit  of  equation  (7).  Chang¬ 
ing  the  depreciation  rate  reinforces  the  finding  that  the  effect  of  advertising  is  measured  impre¬ 
cisely;  however,  it  has  little  effect  on  the  recruiter's  coefficient  or  significance  level.  We  also 
experimented  with  models  that  included  lagged  recruiters  and  current  recruiters  squared.  Only 
current  recruiters  seem  to  be  important,  and  their  marginal  productivity  appears  to  be  constant 
over  the  range  of  recruiting  force  levels  observed. § 

To  check  the  stability  of  the  model,  we  predicted  HSGs  in  CY  1978  using  actual  levels  of 
explanatory  variables.  Predictions  are  given  in  Table  2.  The  model  does  well:  it  predicts  the 
sharp  decline  in  HSGs  with  only  a  2.1  percent  error  for  the  year.  These  results  increase  our 
confidence  in  the  model  and  the  implied  estimates  of  marginal  products  for  R  and  A. 

The  model  was  estimated  with  data  from  a  period  when  HSGs  were  greatly  affected  by 
demand  considerations,  probably  through  changes  in  standards.  The  model  essentially  uses 
goals  as  a  proxy  for  standards,  and  in  CY  1978  goals  were  sharply  lowered.  The  predictive 


’Imprecise  estimate*  of  X(ll)  sod  Y(ll)  may  explain  a  peculiar  reautt  obtained  for  the  estimate  of  the  base  pariod  cap¬ 
ital  stock  (Xg).  It  is  calculated  by  the  ratio  of  X(ll)  and  Y(ll)  coefficients.  For  HSGs  the  astimata  of  Kt  ia  about  170 
thousand  units  of  awareness  capital.  Although  the  estimate  is  positive,  as  expected,  it  seems  too  high.  Given  a  depre¬ 
ciation  rat*  of  11  percent  per  quarter,  to  obtain  such  a  high  steady-state  level  would  require  advertising  to  be  about  S24 
million  per  year.  Prior  to  FY  1972,  however,  advertising  was  at  much  lower  levels.  Sine*  we  use  the  ratio  of  imprecise 
estimate*  to  calculate  A*,  its  estimate  is  also  likely  to  be  imprecise.  Perhaps  this  explaint  the  high  estimate  of  Kt  ob¬ 
tained. 

tTbs  unexpected  sign  for  GIBL  is  probably  caused  by  the  fact  that  a  relatively  large  number  of  contracts  wen  signed 
just  prior  to  reduction  of  O.l.  Bill  benefits  in  the  first  quarter  of  CY  1977.  This  cawed  a  tpffiover  of  HSOs  in  1977, 
which  would  tend  to  reverse  the  Mgn  for  the  G.I.  Bill  quartan  -  IQ  1977  -  4Q  1977. 
tTbe  sign  of  AVF  changes  direction  for  MG1-3U  HSGs,  but  it  is  not  statisticafiy  significant  in  either  medal. 

(For  details,  see  Reference  19],  Appendix  B. 


NAVAL  RESEARCH  LOOBTKS  QUARTERLY 


VOL.  29,  NO.  2,  IUNE  1912 


RECRUITERS.  ADVERTISING.  AND  NAVY  ENLISTMENTS 


395 


TABLE  2  -  HSG  Predictions  for  CY 1978 


Quarter 

Predictions 

Actuals 

Predictions 
—  actuals 

Percent 

errors 

1 

9,950 

9,880 

70 

0.7 

2 

10,440 

10,989 

-549 

-5.0 

3 

17,731 

18,344 

-613 

-3.3 

4 

9,631 

9,571 

60 

0.6 

Year 

47,752 

48,784 

-1,032 

2.1 

accuracy  of  the  model  suggests  that  the  low  level  of  HSGs  achieved  in  CY  1978  was  partly  due 
to  increases  in  standards,  and  that  HSGs  might  be  increased  in  the  future  by  lowering  entry  res¬ 
trictions. 

Marginal  Products  of  Recruiters  and  Advertising 

Marginal  products  (MPs)  and  elasticities  for  CY  1978  are  given  in  Table  3.  An  additional 
recruiter  would  yield  14.0  HSGs  of  which  12.3  would  be  MGl-3Us.  Thus,  most  of  the 
recruiters'  impact  is  on  HSGs  in  the  upper  mental  groups.  These  enlistments  are  generated 
immediately,  based  on  our  experiments  with  other  models  including  lagged  recruiters. 


TABLE  3  —  Effects  of  Recruiters  and  Advertising  in  CY  1978 “ 


Supply  factors 

All  HSGs  (48,784) 

MG1-3U  HSGs  (29,844)* 

MP 

MP 

Elasticity 

One  recruiter 

mm 

■■■ 

All  (3425) 

12.3 

1.42 

Except  administrators  (2808) 

| 

12.3 

1.16 

$1M  advertising' 

I 

($10  million) 

WWM 

0.25 

445 

0.15 

•Numbers  in  parentheses  are  CY  1978  levels  of  variables  which  are  used  to  calculate  elasticities. 
‘Estimated  from  ratio  of  MG1-3U  HSGs  to  all  HSGs  in  first  three  quarters  of  CY  1978,  e.».,  0.61. 
'Effect  of  $1  million  of  advertisint. 


The  recruiters  elasticity  is  0.98  for  HSGs,  but  if  we  exclude  administrators  it  is  only  0.80. 
The  lower  elasticity  is  probably  more  accurate.  This  is  because  the  regression  model  yields  an 
estimate  for  the  type  of  recruiter  added  during  the  regression  period,  and  we  suspect  that  few  of 
those  added  were  administrators. 

A  once-and-for-all  increase  in  the  advertising  budget  of  $1  million  would  yield  1,206 
HSGs  in  the  long-run.  Only  445  or  37  percent  would  be  in  the  upper  mental  groups,  however* 

The  effects  of  advertising  are  felt  mostly  in  the  future.  Based  on  the  estimated  depreda¬ 
tion  rate  of  1 1  percent  per  quarter,  we  calculate  that  only  30  percent  of  the  total  impact  is  felt 
this  year;  it  takes  four  years  for  82  percent  to  accrue. 


•For  details  on  how  the  effects  of  advettMns  are  calculated,  tea  Reference  [91,  Appendix  C. 
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Thus,  R  and  A  both  affect  HSGs,  but  the  effects  differ  in  two  important  respects  which 
make  advertising  less  attractive:  unlike  recruiters,  advertising  predominately  affects  HSGs  in  the 
lower  mental  groups;  and,  rather  than  this  year,  most  of  its  effects  are  felt  in  the  future. 

Recruiter  and  Advertising  Cost  Per  HSG 

Given  marginal  productivities  and  estimates  of  marginal  costs,  we  calculate  the  cost  of 
generating  HSGs  with  R  and  A.  These  are  given  in  Table  4.  As  discussed  earlier,  a  SI  million 
increase  in  advertising  in  1967  dollars  would  yield  1,206  HSGs  in  the  long-run.  Given  the 
change  in  advertising  costs  since  1967,  in.CY  1978  it  would  take  $2.2  million  of  advertising  to 
produce  what  $1  million  could  buy  in  1967.  The  ratio  of  $2.2  million  to  1,206  HSGs  or  $1,824 
is  advertising's  marginal  cost  for  HSGs.  The  cost  per  MG1-3U  HSG  is  $4,944. 

The  cost  of  an  additional  recruiter  in  CY  1978  was  about  $31,000.  Given  the  recruiter 
productivities  in  Table  3,  the  marginal  cost  of  enlistments  is  $2,214  for  HSGs  and  $2,520  for 
MG1-3U  HSGs. 

Although  in  the  long  run,  the  cost  per  HSG  is  slightly  lower  for  advertising,  in  the  short 
run  it  is  more  costly  than  recruiters.  This  is  because  only  30  percent  of  the  HSGs  generated  by 
advertising  accrue  in  the  first  year.  As  a  result,  the  cost  per  HSG  generated  in  the  first  year  is 
about  three  times  higher  than  recruiters.  To  calculate  it,  we  divide  $2.2  million  by  the  HSGs 
generated  by  advertising  in  the  first  year  (362)  to  yield  a  cost  of  $6,077  per  HSG. 


TABLE  4  -  Long-Run  Cost  Per  HSG  in  CY  1978  for 
One  Recruiter  and  Si  Million  of  Advertising 


$1  Mi 

llion  advertising 

One  recruiter 

Group 

Marginal 

Marginal 

cost* 

Cost 

Marginal 

Marginal 

Cost 

productivity 

(millions) 

per  HSG 

productivity 

cost6 

per  HSG 

All 

HSGs 

1,206 

$2.2 

$1,824 

14.0 

$31,000 

$2,214 

MG1-3U 

HSGs 

445 

$2.2 

$4,944 

12.3 

$31,000 

$2,520 

*The  marginal  coat  of  SI  million  of  advertising  equals  the  advertising  price  index  times  SI  million.  The  price  index  is 
forecast  to  be  2.2  in  CY  197S,  based  on  the  1977  price  level  (2.02)  adjusted  upward  by  the  1977  rate  of  inflation  (9  per¬ 
cent). 

‘Marginal  cost  includes:  SI9.000  for  recruiter's  salary,  allowances,  retirement,  training,  reenlistment  bonus,  and  special 
pay;  and  SI  2,000  for  support  costs  including  salaries  and  operations  and  maintenance  expenses. 

In  summary,  the  long-run  cost  of  generating  HSGs  is  about  the  same  for  R  and  A  — 
$2,000  per  HSG.  For  MG1-3U  HSGs  in  the  long  or  short  run  and  for  all  HSGs  in  the  short 
run,  it  is  less  costly  to  use  recruiters. 

CONCLUSIONS 

Our  findings  suggest  that  R  and  A  could  be  used  to  increase  high  school  graduate  enlist¬ 
ments.  But,  since  advertising  affects  HSGs  mostly  in  the  lower  mental  groups,  the  mix  of  addi¬ 
tional  R  and  A  should  be  weighted  more  heavily  towards  recruiters. 
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With  regard  to  temporary  shortfalls,  additional  resources  are  needed  that  would  have  an 
immediate  effect.  Since  about  half  of  the  HSGs  generated  by  advertising  accrue  after  the 
second  year,  the  proper  strategy  is  to  add  just  recruiters. 

Changes  in  the  unemployment  rate  have  an  important  effect  on  HSGs:  a  one  percentage 
point  increase  in  the  youth  unemployment  rate  would  yield  about  1,700  HSGs  and  12S0  would 
be  in  the  upper  mental  groups;  elasticities  are  0.46  for  HSGs  and  O.SS  for  MG1-3U  HSGs. 
Because  of  its  impact  and  tendency  to  change  frequently,  unemployment  should  be  taken  into 
account  when  formulating  near-term  R  and  A  requirements. 

The  recruiting  difficulty  of  the  Navy  in  FY  1978-79  was  probably  caused  by  declines  in 
unemployment  and  relative  pay;  another  factor  might  have  been  increases  in  standards.  Still, 
the  shortfall  of  HSGs  has  been  a  modest  10  percent.  It  can  probably  be  met  with  just  modest 
increases  in  R  and  A  and  slight  reductions  in  standards. 

In  the  1980s,  however,  the  shortfall  would  be  larger  than  10  percent.  There  will  be  a 
reduction  in  the  youth  population  which  will  not  only  reduce  the  eligible  population,  it  is  also 
likely  to  lead  to  lower  unemployment  rates  and  higher  civilian  wages.  These  changes  would 
further  reduce  the  supply  of  HSGs.  Therefore,  future  shortfalls  are  likely  to  be  larger,  and  to 
meet  them  may  require  increases  in  military  pay  and  benefits  in  addition  to  the  remedies  previ¬ 
ously  suggested. 

Regarding  methodology,  measurement  of  the  productivity  of  advertising  is  difficult 
because  (1)  advertising  seems  to  have  only  slight  effects  which  are  distributed  over  time,  and 
(2)  HSGs  are  strongly  affected  by  other  supply  factors  as  well  as  by  demand  considerations. 
These  problems  were  overcome  by  treating  advertising  as  an  investment  in  awareness  capital, 
and  by  including  it  and  numerous  other  factors  in  a  reduced  form  regression  model. 

While  our  methodology  is  interesting,  the  data  are  extremely  crude  and  estimates  are 
based  on  only  26  observations.  Therefore,  results  and  policy  recommendations  should  be 
viewed  as  highly  tentative.  More  research  is  needed  to  provide  a  solid  basis  for  R  and  A  policy 
making. 
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